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0.1. PREFACE ix 


0.1 Preface 


Welcome to the Worldwide Integral Calculus textbook; the second textbook from the Worldwide 
Center of Mathematics. 


Our goal with this textbook is, of course, to help you learn Integral Calculus (and power 
series methods) — the Calculus of integration. But why publish a new textbook for this purpose 
when so many already exist? There are several reasons why we believe that our textbook is a 


vast improvement over those already in existence. 


e Even if this textbook is used as a classic printed text, we believe that the exposition, expla- 
nations, examples, and layout are superior to every other Calculus textbook. We have tried to 
write the text as we would speak the material in class; though, of course, the book contains far 
more details than we would normally present in class. In the book, we emphasize intuitive ideas 
in conjunction with rigorous statements of theorems, and provide a large number of illustrative 
examples. Where we think it will be helpful to you, we include proofs, or sketches of proofs, 
in the midst of the sections, but the extremely technical proofs are contained in the Technical 
Matters appendices to chapters, or are contained in referenced external sources. This greatly 
improves the overall readability of our textbook, while still allowing us to give mathematically 
precise definitions and statements of theorems. 


e Our textbook is an Adobe pdf file, with linked/embedded/accompanying video content, an- 
notations, and hyperlinks. With the videos contained in the supplementary files, you effectively 
possess not only a textbook, but also an online/electronic version of a course in Integral Cal- 
culus. Depending on the version of the files that you are using, clicking on the video frame to 
the right of each section title will either open an online, or an embedded, or a locally installed 
video lecture on that section. The annotations replace classic footnotes, without affecting the 
readability or formatting of the other text. The hyperlinks enable you to quickly jump to a 


reference elsewhere in the text, and then jump back to where you were. 


e The pdf format of our textbook makes it incredibly portable. You can carry it on a laptop 


computer, on many handheld devices, e.g., an iPad, or can print any desired pages. 


e Rather than force you to buy new editions of textbooks to obtain corrections and minor 
revisions, updates of this textbook are distributed free of cost. 


e Because we have no print or dvd costs for the electronic version of this book and/or videos, 
we can make them available for download at an extremely low price. In addition, the printed, 
bound copies of this text and/or disks with the electronic files are priced as low as possible, to 


help reduce the burden of excessive textbook prices. 


x PREFACE 


In this book, we assume you are already familiar with Differential Calculus. Specifically, 
we assume that you know the definition of the derivative of a function, that it represents the 
instantaneous rate of change, and that you know the “rules” for calculating derivatives. We will 
also need |’H6pital’s Rule and parameterized curves. Referring to the Worldwide Differential 
Calculus textbook [2], this means that you should know the contents of Chapters 1 and 2, Section 
3.5, and Appendix A. 


Our discussion of definite integrals, and their applications, is fairly traditional. However, our 
approach to infinite series is somewhat unusual. Our approach is motivated by two factors. First, 
we believe that the primary use that students will have for infinite series, outside of a Calculus 
class, is that many important functions have convergent power series representations, and these 
power series representations allow the student to mathematically manipulate and estimate the 
functions involved, in ways that would be difficult/impossible without power series. Second, 
statistical data that we collected over several years has made it clear that, in general, students 
do not grasp the basic idea that, when z is close to zero, smaller powers of x are more significant 


than larger powers of x in a power series or, even, in a polynomial function. 


Consequently, we place emphasis on polynomial approximations and power series represen- 
tations for functions, and, in a sense, view the classic convergence tests for sequences and series 
of constants as the “technical details” required to understand power series. We still include a 
chapter, Chapter [5] on sequences and series of constants, but that chapter comes after Chap- 
ter [4] which is on power series and approximating functions with polynomials. We firmly believe 
that this ordering of topics is better for the student and for applications, even though it may 
seem a bit awkward not to have the rigorous mathematical foundations of sequences and series 


come before their use in discussing power series. 
This book is organized as follows: 


Other than the Technical Matters sections, each section is accompanied by a video file, which 
is either a separate file, or an embedded video. Each video contains a classroom lecture of the 
essential contents of that section; if the student would prefer not to read the section, he or 
she can receive the same basic content from the video. Each non-technical section ends with 
exercises. The answers to all of the odd-numbered exercises are contained in Appendix [C] at 
the end of the book. 


Important definitions are boxed in green, important theorems are boxed in blue. Remarks, 
especially warnings of common misconceptions or mistakes, are shaded in red. Important con- 


ventions or fundamental principles, that will be used throughout the book, are boxed in black. 


Very technical definitions and proofs from each section are contained in the Technical Matters 
appendices at the ends of some chapters, or in external sources. Our favorite external technical 
source is the excellent textbook by William F. Trench, Introduction to Real Analysis, [4], which 
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is available, courtesy of the author, as a|free pdf, For producing answers to various exercises or 


for help with examples or visualization, you may find the free web site wolframalpha.com very 


useful. 


Internal references through the text are hyperlinked; simply click on the boxed-in link to 
go to the appropriate place in the textbook. If you have activated the “forward” and ” back” 
buttons in your pdf-viewer software, clicking on the “back” button will return you to where you 
started, before you clicked on the hyperlink. 


Some terms or names are annotated; these are clearly marked in the margins by little blue 


“balloons”. Comments will pop up when you click on such annotated items. 


66 ” 


Occasionally, when looking at approximations, we write an equals sign in quotes, as in “=”. 
We use this to denote “equal as far as a calculator is concerned”, i.e., equal to the precision of 
many/most/all calculators. 


We sincerely hope that you find using our modern, multimedia textbook to be as enjoyable 


as using a mathematics textbook can be. 


David B. Massey 
August 2009 


Chapter 1 


Anti-differentiation: the 
Indefinite Integral 


In this chapter, we discuss anti-differentiation, which is also called indefinite integration. This 
is the process for “undoing” differentiation. In the first section, we start with the basic tech- 


niques/results, and then in the remaining sections, we include some more-complicated methods. 


The indefinite integral should not be confused with the definite integral, which is the topic 
of the next chapter. The definite integral is the mathematically precise notion of what it means 
to “take a continuous sum of infinitesimal contributions.” The reason that both indefinite and 
definite integration are referred to as “integration” is because calculating continuous sums and 
finding anti-derivatives are related by the Fundamental Theorem of Calculus, Theorem [2.4.10] 
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> 
1.1 Basic Anti-Differentiation 


This section is about the process and formulas involved in un-doing differentiation, that is, in 


anti-differentiating. This means that you are given a function f(a) and are asked to produce 
some/all functions F(x) which have f(x) as their derivative. This comes up often in applications, 
such as when you're given the acceleration a(t) of an object and want the velocity u(t), or when 
calculating definite integrals via the Fundamental Theorem of Calculus (see Section and 
Theorem |2.4.10). 


Since you know differential Calculus, you know what it means to have a function F(a) and 
then be asked to calculate its derivative F’(x). For instance, if F(a) = 2°, then F’(x) = 327. 


But what about the “reverse” question? What if you are given the function f(a) = 3x27 and 
asked to produce an anti-derivative of f(a), that is, if you are asked to find a function F(z) 


whose derivative equals the given f(x)? 


Certainly, F(x) = x? is one anti-derivative of 327. Are there any others? According to 
a corollary to the Mean Value Theorem, the only other anti-derivatives of 3x? are functions 
that differ by a constant from the one anti-derivative that we produced, i.e., every other anti- 


derivative F(x) of f(x) = 32? is of the form F(x) = x? + C, for some constant C. 


Definition 1.1.1. Given a function f(x), defined on an open interval I, a function F(x), 
on I, such that F’(x) = f(a) is called an anti-derivative of f(x), with respect to w. 

Thus, an anti-derivative y = F(x) of f(a) is a solution to the differential equation 
dy/dx = f(x). 

If F(x) is an anti-derivative of f(x), on an open interval, then every anti-derivative of 
f(x), on that interval, is given by y = F(x) +C, where C is a constant. The collection 
y = F(x) +C is called the (general) anti-derivative of f(x), with respect to x; it is the 
general solution y to the differential equation dy/dx = f(a). 


The notation for the general anti-derivative of f(x), with respect to x, is 


ic) dx. 


This is also called the (indefinite) integral of f(x), with respect to x. 
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Remark 1.1.2. We have several important comments to make. 


e First, it is important that / f(a) dz is not one particular function, but it almost is; / f(x) dx 


is actually a collection, or set, of functions, any two of which differ by a constant. 


We write 
[3Pae = 27°40, 


where including the +C is extremely important, for changing the value of C changes which 
element of the set of all anti-derivatives of 3x? you are talking about. Technically, we ought to 
write 


[seas = {2° +C|CeR}, 


but this is very cumbersome, and no one (well...no one that we know of) ever writes this. 


e Second, you should notice that it follows from the definition that the units of / f(x) dx are 


the units of f(a) times the units of x. 


For instance, if f(x) is in kilograms per cubic meter, and x is in cubic meters, then / f(a) dx 


is in kilograms. 


e Third, you should think of the anti-differentiation, with respect to x, operator, [ ( ) da, 


as essentially being the inverse operator of a \ differentiation with respect to x. That is, 
the anti-differentiation operator is a compound symbol; it starts with a [, and ends with a 
differential, like dx, which, together, tell you to anti-differentiate whatever is in-between with 
respect to the variable which appears in the differential. 


We wrote “essentially” above because, if you first differentiate and then anti-differentiate, 
you get what you started with, except that there is an additional +C; that is, you end up with 
a collection of functions that all differ by constants, instead of simply the one function that you 
started with. 


e We should also comment on the term “indefinite integral.” There is another notion, called 
the definite integral of a function over a closed interval; see Section [2.3] The definite integral is 
defined in such a way that it agrees with one’s intuitive idea of what a “continuous sum of in- 
finitesimal contributions” should mean. This would seem to be unrelated to anti-differentiating. 
However, there is a theorem, the Fundamental Theorem of Calculus, which tells us: i) 
every continuous function possesses an anti-derivative (Theorem|2.4.7), and ii) the primary step 
used to obtain a nice formula for a definite integral is to produce an anti-derivative of the given 
function (Theorem |[2.4.10). 


Hence, anti-differentation is frequently referred to simply as “integration”, and definite in- 


tegration is also simply referred to as “integration”; the context should always make it clear 
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whether the meaning is anti-differentiation or definite integration. In addition, the symbols for 


anti-differentiating [ ( ) dx are essentially the same as the symbols used for definite integration. 


All of the differentiation formulas which you have learned yield corresponding anti-different- 
iation formulas; it’s just a matter of reading things “in reverse”, for, if F’(x) = f(x), then the 
corresponding integration rule is [ f(a) dz = F(x)+C, where C denotes an arbitrary constant. 
In this context, f(a) is frequently referred to as the integrand. 


For instance, we have a Power Rule for Integration: 


Theorem 1.1.3. For all x in an open interval for which the functions involved are defined, 


i, [O0de=C; 


2. flde=2+C; 


gett 
8. ifpA-l, [era-So +0; and 
pt+l 


1 
4. potas fcar=mlel+e. 


Remark 1.1.4. In many books, only the third formula above is referred to as the Power Rule 


for Integration. 


As is frequently the case, you should try to remember this rule not in symbols, but in words; 
it says that, as long as the exponent is not —1, you obtain the anti-derivative of x raised to a 
constant exponent by adding one to the exponent, and dividing by the new exponent (and then 
adding a C). 


Remark 1.1.5. There are two fairly common, horrific mistakes associated with the inte- 


1 
gration rule fe tde= f de =In|2| +C. 
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The first big mistake is to treat the p = —1 case in the same manner as the cases where 
p#-—1. If you were to do this, you would obtain 


Sita 
ee —— i + 


The undefined division by 0 should immediately tell you that you’ve done something wrong, 
and remind you that you must treat the p = —1 case differently. 


The second big mistake may come later, when we have more integration rules. It will then 


il 
be tempting to look at the formula / — dx =1n|z|+C and think that it implies that 
a 


1 
| ame dx = In|anything| + C. 
anything 


This is completely wrong (in general); it is not true that the derivative of the expression 
on the right would be the integrand. The problem is that, if you differentiate the expression 
on the right, you do, in fact, get a 1/anything factor, but then the Chain Rule tells you that 
that is multiplied by d(anything) /dz. 


Example 1.1.6. Find the function P(r), with domain r > 0, such that 


Solution: 


We find that 


3/2 
p= [rar=* Cus ese 


We need to determine C. We have 


2 
-7 = P(9) = 3 97 +e = 184+C. 
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Therefore, C = —25, and so 


The linearity of the derivative gives us the linearity of the anti-derivative. 


Theorem 1.1.7. (Linearity of Anti-differentiation) Ifa and b are constants, not both 
zero, then 


J efla) + bg(0) ae - a- f (var + bf g(x) ad. 


Remark 1.1.8. The prohibition against a = b = 0 in Theorem is there for just one reason: 
we do not want both of the arbitrary constants on the right to be eliminated by multiplying by 


zero. We will explain this more fully. 


Since each indefinite integral actually yields a set, or collection, of functions, there may be 
some question in your mind about what it means to multiply a set of functions by a constant, 
like a or b, and what it means to add two such sets. In other words, you may wonder exactly 
what the right-hand side of the equality in Theorem [1-1-7] means. 


For instance, what does it mean to write that 


(608 -1vz) ae 7 5+ fa de — Tf vede? 


We know, from the Power Rule for Integration, that fx? dx = x*/4+ C, and 


3/2 
[ved = [aac = sore = 2¢9/2/34 Co, 


where we have used C and C2, in place of using simply C' twice, since we don’t want to assume 


that we have to pick the two arbitrary constants to be the same thing. 


So, what does 5- fa? dz mean? It means the collection of functions obtained by taking 5 


times any function from the collection of functions x+/4+C{; that is, the collection of functions 
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5a*/4 +5C,, where C, could be any constant. But, if C; can be anything, then 50; can be 
anything, and we might as well just call it B,, where B, can be any number. Thus, we can 


write the collection of functions 5- f 2? dx as 5a*/4+ By. 


However, here’s the part that can be confusing; instead of using a new constant name, like 
By, it is fairly standard to just use the name C) again, i.e., to use Cy to now denote 5 times 
the old value of C,. Assuming that we had not determined some value for the old C1, there is 
no harm in doing this, but it certainly can make things look confusing, for you frequently see 


calculations like 


5. fade = 5(x*/4+C;) = 5a*/4+Ch, 


where the C on the far right above is actually 5 times the C, in the middle. 


Similarly, we write 


-1- f eds = pera = —7(207/?/3+C,) = —1423/?/3+ Cy. 


Therefore, 


5 faSae—1- | ved = 5a*/4+C,— 142°/7/34+C) = 


bat /4 — 1409/27/34 (C1 +C2) = 5a4/4— 1407/72/34, 


where C = C, + C2 can be any real number. 


Using this example as a guide, you can see what to do more generally: whenever you have 
a linear combination of indefinite integrals, i.e., a sum of constants multiplied times indefinite 
integrals, you do not include an arbitrary constant for each individual indefinite integral; instead, 
for each indefinite integral you write one particular anti-derivative, and then put in a single +C 
at the end. 
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The fact that an indefinite integral is actually a collection of functions can lead to seemingly 
bizarre results. For instance, while it’s true that [dx = { xdx, it would nonetheless be 
bad to write that { «dx — { «dx = 0. Why? Because our operations on sets of functions 
tell us that the correct calculation is 


[ede- fede = a*/2+C, — (x?/2 + C2) = —=Oh = GC, 


which is the same as [ 0dz. 

This agrees with what we wrote above: when you have a linear combination of indefinite 
integrals, you should use one particular anti-derivative for each integral, and then add a C 
at the end. Thus, in the above calculation, you should get 27/2 —«?/2+C, which is just C. 


Example 1.1.9. Calculate the indefinite integral 


[ (2-34 708 +590) dw. 
W 


Solution: 


We calculate 


1 
[ (2-34 708 +590) dw = [ (9: 5-34 708 +50") dw = 


w* w/9)+1 
51 3w +7: 5: C= 
n|w| — 3w ri + (1/9) +1 + 


Tw* 9 10/9 
5ln|w| — 3w 4 7 S 


Example 1.1.10. Suppose that an object moves in a straight line with constant acceleration a 
meters per second per second. Show that the position of the object p = p(t), in meters, is given 
by 

p =at?/2+ vot + po, 


1.1. BASIC ANTI-DIFFERENTIATION 9 


where po is the initial position of the object (i.e., the position at t = 0), in meters, vo is the 


initial velocity in m/s, and t is the time in seconds. 


Solution: 


Acceleration a is the derivative, with respect to t, of the velocity v, i.e., a = du/dt. This is 
exactly the same as writing that v is an anti-derivative of a, with respect to t, ie., v= f adt. 


Since a is a constant, we find 
v= [oat = a fiat = at+C m/s, 


for some constant C’. Therefore, v = at + C, but we would like to give some physical meaning 
to the constant C’. 


How do we do this? We are not given any other data. The answer is that we use tautological 
initial data; that is, we use initial data that is simply obviously true. We use that, when t = 0, 
the velocity v equals vg. Why is this true? Because it says something that’s clearly true: at time 
0, the velocity equals the velocity at time 0. It may seem strange, but using this tautological 


initial data actually gets us somewhere. 


We have v = at+ C. Now, plug in that, when t = 0, v = vp. You find vy =a-04+C, ie., 
C = vo. Thus, we conclude that v = at+ vo. Notice that no one has to give you any extra data 


to conclude that C' = vo; it follows from the equation v = at+ C. 


Now, we have that v = at+ vo, and we know that the velocity v equals the rate of change of 
p, with respect to time, i.e., v = dp/dt. This is the same as writing that p = [ udt. Therefore, 


p = [oa = [lat eoyae = af tat) +1 (10) _ 


at?/2+vpt+C meters, 


we have 


where this C is definitely not the same C that we used in the equation for v. 


How do we find this new C? We plug in more tautological initial data, namely that p = po 
when t = 0. We find that pp = a(0)?/2 + vo(0) + C, and so C = po. Thus, we conclude 


p = at? /2+ vot + po meters. 


Other integration formulas obtained at once from differentiation formulas are: 
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Theorem 1.1.11. As functions on the entire real line (—co, 00), we have 


1. 
[cosas = sinz+C; 


[suede = —cosxz+C; and 


pew = e+C. 


Note that the integration formulas for sin and cos have the negative sign in the “opposite” 
place from the differentiation formulas. This frequently leads to confusion. It shouldn’t. 
Remember: you are finding anti-derivatives. This means that the derivative of what you 
end up with should be what you started with (i.e., the integrand). 


Let’s look at another example in which you are given the acceleration of an object, and are 
asked to find the velocity and position, but, this time, we have an acceleration which is not 


constant. 


Example 1.1.12. Suppose that the acceleration, in m/s”, of an object moving in a straight 
line is a = sint, where t is the time in seconds. Find the velocity and position of the object, as 
functions of time, in terms of the initial velocity and initial position. 
Solution: 
We find that 
v= [eat = [sutat = —cost+C, 


and, plugging in the tautological initial data, we find vo = —cos(0)+C = -1+C. Thus, 
C=vo +1, and 
v = —cost+uot+1 m/s. 


Now, we integrate the velocity to find the position: 
p= [rat = [eeost+ um + 1) at — — | costdt + (vy +1) frat = 


—sint+(vo + 1t+C. 
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Finally, we plug in the tautological initial data, in order to give physical meaning to this last 
C: 
po = —sin(0) + (vo + 1)(0)+C. 


Therefore, C = po, and we find 


p = —sint+(vo+1)t+po meters. 


We shall not rewrite, as anti-differentiation formulas, every one of the differentiation formulas 
that you should know; the standard anti-differentiation formulas are contained in Appendix [B] 
However, we'll go ahead and give two more, before looking at the Chain Rule and the Product 


Rule in their indefinite integral forms. 


Theorem 1.1.13. As functions on the open interval (—1,1), 


sins ba +C. 


| a= 
ae 
1 — x? 


As functions on the open interval (—oo, co), 


1 
Peet = tan tar+C. 
14+ 2? 


Example 1.1.14. Calculate 
727 +5 
———_ dz. 
/ 2241 3 


Solution: 
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We begin by “simplifying” via long division of polynomials, except the division is not so long 
in this case. We get clever and write 7z? = 7(z? + 1) — 7, and find 


T7245 22+1)-745 sce! 


z+ z+ 241 


Thus, 


2 
[4 rae = f (2 de = 7 f dz - 2. f de = 
z24+1 z2til z24+1 


Tz —2tan 'z+C. 


Substitution: the Chain Rule in anti-derivative form: 


The Chain Rule for differentiation tells you how to differentiate a composition of functions. 
If f and g are differentiable, then (f(g(x)))’ f'(g())g’ (x) or, letting u = g(x), the Chain 


Rule can be written as 


d ' 
at) = f'(u)—. 


As an anti-derivative formula, this becomes 


Theorem 1.1.15. (Integration by Substitution) Jf f and g are differentiable functions, 
then 


i f(ge))g (a) de = f(g(2)) +C, 


or, letting u= g(x), 


The second formula for substitution is particularly easy to use; it looks as though the dz’s 


cancel, as in multiplying fractions. This is not what’s happening, but it does make substitution 


1.1. BASIC ANTI-DIFFERENTIATION 13 


easier to remember. It also means that the differential notation 


which we introduced in [2], yields correct formulas in integrals, and, hence, we use it extensively. 


Example 1.1.16. Calculate the integral 


[coste' +7)-e" da. 


Solution: 


How should you approach an integral like 


[coste' +7)-e" dx? 


First, you should realize that it’s not just a linear combination of integrals of specific functions 
that you’ve memorized. You should then think “Well...cos(e” +7) is a composition of functions. 
What if I make a substitution for the “inside” function? I'll let u = e* + 7, so that cos(e* + 7) 


becomes cos u, and see if the remaining part of the integrand looks like du.” 


In fact, for easy cases, you can do this in your head. If wu = e” + 7, then 


du = Gwe = & 


and you see that this is the remaining “factor” in the integral. Thus, by substitution, our original 


integral is transformed into an integral in terms of u that is very simple: 


[cos(et +1) et de = [coswdu = snu+C = sin(e”™+7)+C. 


Nice. 
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Example 1.1.17. Calculate the integral 


Solution: 


You once again realize that this integral is not some linear combination of basic integrals 
that have memorized, and there’s no obvious composition of functions this time. You might 
think that tan~'t is involved somehow, since f 1/(1 + ¢?)dt = tan~'t + C, but the ¢ in the 


numerator causes a problem. 


You might think that you can factor the integrand and use that in some way: 


t 1 2 1 te 
ee ee en 
i= it ee ae | ites: ya are al 


THIS IS COMPLETELY WRONG. You can’t just integrate each factor in a product 
and then multiply the results; dealing with products in an integral is more complicated than 
that. 


We will get to the integral form of the Product Rule shortly, but, for now, you should 
differentiate t? tan~!t/2 (using the Product Rule) and verify that you don’t get anything close 
to our integrand t/(1 + #7). 


Great. Now we know one way NOT to find the integral { ¢/(1+ t?) dt. We also know that 


we’re discussing substitution here, and so you should suspect that a substitution is involved. 


With practice, you should actually see relatively quickly that if you let w = 1+ #7, then 
dw = 2tdt, and we have a tdt in the integral, so this substitution might be good. We can 
always “fix” multiplying by a constant, like the 2 in dw = 2tdt. We divide by 2 to get 


1 


and 


! 1 i. 4 
t= -tdt = | —-=dw = 
/ae lm : |< he 
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1 1 1 1 1 
{ae = <Injwl)+C = <mnll¢+#|+C = =In(1+27)+C, 
2/ w 2 2 2 


where the last equality follows from the fact that 1+ ¢? > 0 (in fact, 1+¢? > 1). 


It would be a good exercise for you to differentiate our final answer above, and see how the 


Chain Rule comes into play to produce our initial integrand. 


By making the substitution u = v/a, so that x = au, we easily obtain: 


Theorem 1.1.18. Suppose that a > 0 is a constant. As functions on the open interval 
(—a,a), 


= sin! (=) +C. 


1 
en — x o 


As functions on the open interval (—oo, co), 


Integration by Parts: the Product Rule in anti-derivative form: 


The Product Rule as an anti-derivative formula is 
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Theorem 1.1.19. (Integration by Parts) If f and g are differentiable functions, then 
[fod aac [ g(x)f'(e) ae 


or, letting u= f(x) and v = g(a), 


fude+ fvau = w+C 
[ud = w~ fvdu 


It is the last formula above that most people memorize as THE formula for integration by 


parts. 


Here’s how a basic integration by parts attack on a problem goes. You look at your integral, 
and make a choice that let’s you write the integral in the form {udv. Then, you apply the 
integration by parts formula to obtain that your integral equals wv — f vdu. Then, you hope 
that the new integral fv du is easier (or, at least, no harder) to integrate than the integral that 


you started with. 


Example 1.1.20. Calculate 
Je dz and [ee dz. 


Solution: 


You should look at f ze* dz, and realize quickly that ze* does not result from one of the 
basic derivative formulas that you should have memorized, and so this integration is not a basic 
one that you should know immediately. In addition, if you think for a minute or so, you should 
be able to convince yourself that there’s no substitution that will help. However, we do see that 
the integrand is the product of two very different-looking functions, z and e”; this is a hint that 


integration by parts may be good to use. 
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Now what do you do? You identify some factor in the integrand that will be u; that factor 
should not contain the differential (here, dz). The remaining part of the integrand, together 
with the differential, should be dv. With u and dv determined, you calculate du and v = f{ dv. 
You do not need to include an arbitrary constant in your calculation of v; we need some v, not 


all possible v’s. Then, you write wv — fv du, look at your new integral, and hope that it’s easier. 


Let’s see how this works for I. ze* dz. There are two obvious choices for u: either u = z 
or u = e*. Either one of these will lead to a formula that is true, but only one will lead to 


something useful. 


Let’s look at the bad choice first, so that you can see how you can tell when you’ve made 


a bad choice. Let’s try u = e*. That leaves zdz to be dv. Now, if u = e* and du = zdz, then 


du = e* dz andv= f dv = f[ zdz = z*/2, where, as we discussed, we don’t include a +C in our 


calculation of v. Applying the integration by parts formula, we find 


prea = [eee = [ude = w= [vd = 


(e*)(z7/2) — ferme dz. (1.1) 


This is true, but not particularly helpful for calculating ri ze* dz. The power of z went up, and, 
after moving the constant 1/2 outside the integral, the rest of the integral, the e*, is the same 
as what we started with. You should realize that the new integral is harder to deal with than 
the original. We could try to integrate by parts again, but, we leave it as an exercise for you to 
verify, depending on your new choice of u, that either the power of z goes up again (and the e* 
remains), or the power of z goes back down to 1, but you end up with exactly the integral that 
we started with. 


So, let’s make the other choice for u in integrating by parts to calculate [ ze* dz. Let u = z, 
which means that dv = e* dz. Then, we find that du = dz and v = [ du = f[ e* dz = e*. Thus, 


we obtain 
peu = [ude = wf vdu = 


ze — f cds = ze*—-—e* +C. 


Why did this choice of u work better than our earlier one? Before, when we picked u to not 
include the power of z, the power of z was left in dv; when we integrated dv to get v, the power 
of z went up. Now, when we pick u to be the power of z, namely z!, the power of z goes down in 
the calculation of du. For this reason, it is frequently (but not always — see the next example) a 
good idea in integration by parts problems which include powers of the variable to let the power 


of the variable be wu. 
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With this in mind, how do you integrate [ z*e* dz? You integrate by parts, letting u = z?, 


which means dv = e* dz. What you should get is actually exactly what we got in Formula|I-1] 
except you need to multiply Formula(I.1] by 2, and rearrange, to obtain: 


[ee dz = ater 2 f cet dz. 


Now, even if we had not already calculated f ze* dz, you should realize that the new integral on 
the right above is easier than the one you started with, and you would calculate it by a second 
integration by parts (if we had not already done so). What we find is 


[Pea = 26 = 2 f ze* dz = ze* —2(ze? —e7 +C) = ze” — 2ze7 + 2e7 + C. 


It might be instructive to differentiate the final result above, on the right, and verify that you 


obtain z7e?. 


Example 1.1.21. Calculate 


fe Intdt and [intat 


Solution: 


You look at ft? ntdt, and you realize immediately that this doesn’t come from one of our 
basic derivative/integral formulas. You might think about a substitution, like w = Int, for a 
minute or so, but then realize that it doesn’t get you anywhere. Then, you decide that, since the 
integrand is the product of two different kinds of functions, maybe integration by parts would 
be a good thing to try. 


If you look at our previous example, you’d probably be tempted to let wu be the power of 
t, i.e., let u = ¢°, which would lead to dv = Intdt. However, this is one of those times when 
picking the power of the variable to be wu is a bad idea. Perhaps the most obvious reason why 
this is bad is because we don’t know how to calculate v = {dv = { Intdt. You may think that 
we’ve discussed this integral, and that it equals 1/t + C. This is very wrong. We know that 
(Int)’ = 1/t or, what’s the same thing, [ 1/tdt =Int+C (for t > 0), but we don’t know f Intdt 
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(it’s possible that you do, but it hasn’t been discussed in the book up to this point). In fact, 
calculating [ Int dt is the second part of this example. 


7 Intdt, 


we'll try integration by parts with u = Int, which means that dv = t° dt. 


So, to calculate 


We find that du = (1/t) dt and v = [dv = ft? dt = t®/6 (remember, we don’t need a +C 
here). Applying the integration by parts formula, we find 


je ineae = [ude = w— f vdu = (iny(e°/6) — f (a8 foy(a/e) at = 


‘Int “| it. 1 es int: 4 
6 6 «6 7 


Now, let’s look at f{Intdt. How could this possibly be an integration by parts problem? 
There’s no product in the integrand! Admittedly, this does not look like an integration by parts 


problem. Nonetheless, it is. Let wu = Int, which means that dv = dt. Then, du = (1/t) dt, and 
v= [dv = f dt =t. Applying the integration by parts formula, we obtain 


[ontat = [ud = w~ fvdu = (ney(t) — f e(4/t)at = 


tint frat = tlhnt—-t+C. 


We wish to look at one more integration by parts example, a complicated example. 


Fi e” cos xz dz. 


Example 1.1.22. Calculate 


Solution: 


We will calculate this integral via integration by parts. We could use either u = e* or 
u = cos; this time, it actually makes little difference. We will pick u = e”, which means that 


dv = cosxda. (As an exercise, you should try starting with u = cosx instead.) We find, then, 
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that du = e* dx and v = [dv = f{cosadx = sinx. The integration by parts formula tells us 
that we have 


[et cosra = [ude = w— f vdu - sing ~ f (sinz)e* de, (1.2) 


But our new integral, { (sin x)e* dz is clearly just as difficult to integrate as the original integral. 
Are we getting anywhere? The answer is “yes”, but it’s not obvious yet. We will integrate 


J (sin x)e* dx by parts also. 


You have to be careful. If you make the choice u = sinz here, you will obtain that 
[(sinz)e* dr equals e* sinx — f e* cosxdz. If you substitute this into Formula [1.2] you will 
find that you have undone our original integration by parts, and you will come to the stunning 
conclusion that { e? cosa dx = [ e* cos dx. It’s true, but not very helpful. 


When integrating (sin z)e” dx by parts, you need to make the choice of u that is analogous 
to your choice of u for the original integration by parts; you let u = e” again, and so dv = sinz dz. 


Then, du = e” dx and v = f{ dv = f sinxdxz = —cosz. Applying integration by parts, we find 


[ina)e* de = [ude = w— [ vdu = c(— cose) ~ [ (—cos.)e* de = 


—e* cost fe cos x dx. 


If you insert this result into Formula[1.2| you obtain 


[er cose ae = sing +e* cosa — fe coside. 


At this point, you may be thinking to yourself “Aaaaagggghhhhhh! We spent all that time 
integrating by parts, only to end up with the same integral that we started with!” However, the 
fact that the new occurrence of the original integral has a negative sign in front of it saves us. If 
you simply add fe” cos x dz to each side of this last equation (and fix the loss of the arbitrary 


constant on the right), you obtain 


2+ fe cos. de = e*sinz+e*cos4+C, 


1.1. BASIC ANTI-DIFFERENTIATION 21 


and so 


De cin ze 
‘ e~ sin x + e” cosz 
[et cose de = 5 tC, 


We end this section with a possibly surprising complication that exists for anti-differentiation; 


a type of complication which does not occur for differentiation. 


Remark 1.1.23. From the derivative formulas in [2], we see that the derivative of any elemen- 
tary function is again an elementary function. You might hope that anti-derivatives/integrals 
would behave equally as well. They do not. It is easy to give elementary functions f(x) for 
which it is possible to prove that there is no elementary function F(a) such that F’(a) = f(a), 


x 


i.c., f(x) has no elementary anti-derivative. Such functions f(x) include e””, e~*’, sin(a?), and 


cos(?). This was first proved by Liouville in 1835. 
The Fundamental Theorem of Calculus, Theorem 2.4.7] guarantees that the functions e””, 


2 . . . . . . . 
~®"  sin(x”), and cos(x?), and, in fact, all continuous functions, have some anti-derivative, but 


e 


those anti-derivatives need not be elementary functions. 


1.1.1 Exercises 


Calculate the general anti-derivatives in Exercises 1 through 21. 


1. [ot +4049) de g 
2 / (2 — Te" + 6ya) dw 


. 3 
é | (ssint- 5) dt 
4 freee du 


y2 


ww 


On 
ae, oe 
ele 

+ 
<< 
iw) 

+ 
= 
Se 
Ka 


lon) 


) 
: les — dz 
7. J cost20 —1)d0 g 
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8. pete 
r 
9. Joo dr g 


x 1 


0. f(T wero) 3 
11, [oo 3)! de 


12. / (2cos(2t + 5) + 3sin(9t)) dt 


13. ic [(x + 2)**°] dx 


19. [ivere dt 


20. 


1 
x ax 


21. [— ae 
V—-2?+6xr-—8 


In each of Exercises 22 through 31, find the anti-derivative of the given function 
which satisfies the given initial condition. The anti-derivative of each function with 


a lower-case name is denoted by the upper-case version of the same letter. 


22. h(x) = 427 + 4a + 9, such that H(—1) = 2. 


5 
23. p(w) = a Te” + 6¥/w, such that P(—1) = 0. 


24. g(t) = 5sint — , such that Q(0) = 7. 


3 
V1-?? 
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1+ 
Oe) ea ee a che 
v 
bie += — eS 
a =>---+> 5 UL = . 
yy pti 
27. f(x) = 27+ zsinz, such that F(r) = 2r. 
2 
28. s(t) = ——,, such that S(e?) =5. 
8. s(t) H(iney2? ° at S(e*) =5 
29. g(x) =aVx+1, such that G(0) = 1. 
tan~* (4 
30. w(y) = a such that W(2) =. 
31. r(t) = te! — t, such that R(1) = V2. 


In each of Exercises 32 through 41, use integration by parts to find the indicated 


anti-derivative. 


32 


33. 


34. 


35. 


36. 


37. 


38. 


39. 


AO. 


Al. 


42. 


. | re®* dx 

f(x@—5)%e* dr 

J tsin(2t) dt 

J t? cost dt 

J Vp npdp 
at 

fe*sinadr 


fe?” sin(5z) dx 


f tan—! wdw 


x-axis with an acceleration function, in m/s”, of 


a(t) = sin(2t), 


where t > 0 is measured in seconds. 


fwtan~'wdw Hint: At some point, you may want to use that w? = (1+ w?) — 1. 


Suppose that the net force F’, acting on an object of mass m, pushes the object along the 
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a. Recall that F’ = ma and that momentum p = mv. Find the momentum of the object, 
as a function of time, if the mass of the object is 10 kilograms, and momentum at 


time ¢t = 0 is 20 kilogram meters per second. 
b. What is the momentum of the object at time t = 4 seconds? 
43. Repeat the preceding problem with the new acceleration function a(t) = ¢t- sin(2t). 
44. For each positive integer n, define f,,(@) = sin” 6 cos@ and g,(@) = cos” 6 sin 


a. Find f f4.(8) dé and [ gn(6) dé. 


b. Find the specific anti-derivatives F,,(0) and G,,(@) that satisfy the initial conditions 
7 


F,,(0) =5 and Gy (5) Zi 


In Exercises 45 through 48, you are given the velocity of a particle at time t, and 


the position p(to) of the particle at a specific time to. Find the position function. 


45. u(t) = 3t? — 4t +3, p(1) = 4. 

46. v(t) = t+ cos(t), p(0) = 0. 

AT. v(t) = 2/18 + 7#?, p(1) = 8. 

48. v(t) = at + b, p(2) = 5. Leave your answer in terms of a and b. 


49. In the following steps, you will calculate the general anti-derivatives for sin? 2 and cos? «. 


a. Apply integration by parts to / sina - sin dx (written suggestively). 


b. Integration by parts yields a new anti-derivative. Use a trigonometric identity to 
write this new anti-derivative in terms of sin? z, and solve your integration by parts 


equation for sin? x dz. 


c. What is [cos xdx? Hint: Use your answer to part (b). 


d. From the cosine double angle formula, cos(2r7) = 2cos* x — 1. Use this to integrate 
cos? x, and explain why the different-looking answer that you obtain is, in fact, the 


same as your answer from part (c). 


Exercises 50 and 51 show that the argument in Exercise 49 can be generalized to 


calculate anti-derivatives of higher powers of sin and cos. 
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50. a. Use integration by parts to prove that 


1 —1 
[sur eat = ——costsin™ !t+ ToS f sin? eat 
n n 


Assume n > 2. 


b. Use this formula to calculate / sin? tdt. Check your answer by comparing to the 


previous problem. 


51. a. Use integration by parts to prove that 


i —1 
[cost tat == gos“! ¢sind / cos”? ¢ dt. 
n n 


b. Use the formula in part (a) to determine / cos? t dt. 


52. Suppose that instantaneous rate of change, with respect to time, of a population of an 


island at time t, measured in years, where t = 0 corresponds to the year 2000, is given by 


1 500 
2,/6,250,000+t (t+1)2° 


The population of the island in 2000 is 3000. 


a. Find an explicit formula for the population at time t. 


b. What is the predicted population in 2050? 


53. Prove that the argument used to calculate i, Int dt can be generalized. Assume that f(t) 


is differentiable and prove that 


[ro at =ep(t)— f tf at. 


54. Calculate fe sinhadz. Hint: do not use integration by parts. 
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55. 


56. 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


65. 


Consider the following logic in calculating / e” sinh x dz using integration by parts. 


fe sinh x dx = e” sinh x — / e” cosh x dx 


=e" sinhx — (< cosh x — 12 sinh x a) > 
0 = e” sinhz — e* cosha > 


e” sinh x = e* coshz. 


Since e* > 0, we can divide and conclude that sinha = coshz. What is the flaw in this 


argument? 


Prove the formula 


jee dt = te’ — Hf te dt. 


1 
Calculate / ue dz, where a # 0 is a constant. Hint: use Theorem 4.2.14 and an 
x+a 


Assume that n> 1. 


appropriate substitution. 


Prove that = sinh! (=) +C. 
a 


/ dx 
Vitae 


Calculate dx, where a > 0 is a constant. 


— 


1 


Calculate dx. Hint: consider sec” * x. 


laws |x a 
Calculate - e“e® dx. Hint: use substitution. g 


Let f,(x) = e® and, for all integers n > 2, let f,(x) = e/-1™. Prove that 


[aw i okt de— 2 Gy. 


Calculate “dx. Hint: rewrite 2” as an exponential expression with base e. 


Calculate 


7 


Calculate n(1+ x 


pe. 
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32 _ p2x 
66. Calculate / ae e 
e7= — 1 


Consider a simple electric circuit with an inductor, but no resistor or capacitor. 


A battery supplies voltage V(t). If inductance is constantly L, in henrys, then 
Kirchoff’s Law from Example 2.7.18 of tells us that the current i at time t 


d 
satisfies the differential equation Le = V(t). In Exercises 67 through 70, find an 


explicit formula for i(t), given the condition i(tg) = io. 


67. L = 12, V(t) =sint, i(0) =0. 


69. L=3, V(t) = 9,4, 1(0) =0 
mo £13 VG)= ue i(1) =2 


d 
For each of the functions in Exercises 71 through 74, verify that (a) de / f(x) as| = 
x 


f(x) and (b) / Faia) dx = f(x) +C. Assume an appropriate domain for f(z). 
4 


71. f(a) =2*. 


72. f(x) = 3cos(2z). 


73. f(z) =Ina. 

1 
4, = —~. 
te Ae) 142? 


75. In the following steps, you will find the general anti-derivatives for functions of the form 


t” Int. 


a. Suppose that n 4 —1. Apply integration by parts to find ie Int dt. 


b. Find jpetineae 


76. Suppose that water flows out of a hole 0.1 square meters in area from the bottom of a 
cylindrical tank with a base radius of 2 meters and an initial height of 10 meters at a rate 


d 
_ = 0.007798t — 1.3999 cubic meters per second. 
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77. A particle is traveling along the curve y = x 


a. 


b. 


c. 


If the tank starts out full, what is the function V(t) for the volume in the tank at 


time t? 
Calculate the amount of water remaining in the tank one minute after the leak starts. 
Verify that uv = 0 precisely when the tank is empty. 


2. so that its z-coordinate is a function of 


time t, measured in minutes. Suppose that the horizontal velocity (i.e., velocity in only 


the horizontal direction) is given by dx/dt = 0.5 cos? t miles per minute. 


a. 


Cc. 


d. 


What is the maximum horizontal speed (absolute value of velocity) on the time in- 
terval 0 < t < 207? 


. Find the function x(t), the x-coordinate of the particle at time t, subject to the 


condition that the particle is at the origin at time ¢t = 0. 
Find the function y(t), the y-coordinate of the particle at time t. 


What is the vertical velocity of the particle at time t = 7 minutes? 


78. An enclosed room is built in order to experiment with the effects of pressure changes on 


objects. Suppose that the equipment is capable of decreasing the pressure in the room 


at arate of — 


2 


7a — 0.04t kilopascals, kPa, per second, and that it can run for up to one 


minute before overheating. 


a. 


If the internal pressure in the room is 101.325 kPa (equal to the standard atmospheric 


pressure or atm), find an expression for P(t). 


. How long will it take to reach 50 kPa (this is approximately the atmospheric pressure 


five and a half kilometers above the surface of the Earth)? 


. What is the lowest pressure that can be reached in the room? 


In Exercises 79 through 82, you are given the acceleration, a = a(t) in m/s’, of an 


object moving in a straight line, where ¢ is the time in seconds. Find the velocity 


v and position p of the object, as functions of time, in terms of the initial velocity 


vg and initial position po. 


79. 


a=t+l1l 
a=sint+cost 
a=e 
a=te* 


1.2. SPECIAL TRIG. INTEGRALS AND TRIG. SUBSTITUTIONS 


1.2 Special Trigonometric Integrals 
and Trigonometric Substitutions 


In this section, we will discuss some special, important, trigonometric integrals, and then give 


three examples of integration by trigonometric substitution. 


Proposition 1.2.1. 


[ened = —In|cos6|+C = In|sec6|+C. 


Proof. We have 


j[ vonoas = 7 ane ap. 
cos @ 


Let u = cos 6, so that du = — sin @d6, i.e., —du = sin@ d@. We find 


[= = [— S aaeo = inlet ee 
U 


cos 8 


Proposition 1.2.2. 


[ secoae = In|secé+tané|+C. 


Proof. As (tan)! = sec? @ and (sec 0)’ = sec 6 tan 6, it follows that 


(tan 6 + sec 0)’ = sec” 6 + sec @ tan @ = (sec 0)(tan 6 + sec 6). 


This means that, if u = tan@ + sec 6, then du/d@ = usec 9, ie., 


1 
J secaae = pee = Inju| +C = In|secé+tan6|+C. 
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Proposition 1.2.3. 


C. 


J sec? 9 = sec 6 tan @ + In| sec 6 + tan 6} | 


2 


Proof. This is a “fun” integration by parts problem. 


J seo? 9 = J sec0(1 + tan? 0) a9 - [ secaae - J secotan? oo = 


In|sec@ + tan 6| + J (t2n6)(s0c 0 tan 6) dé. 
We approach this last integral by parts; let uw = tan@ and dv = (sec@tan@)d6é. Then, du = 


sec? 0d0 and v = f{ dv = sec 6. We find 


J (e218) (s0c0 tan 6) db = [ude =w_-— f vdu = tandsecd— f sec* 0d. 


Combining this with our previous equality, we obtain 


J seo? a9 os sec tan6-+In|sec#-+ tand| — sec 6d. 


Adding f[ sec? 6@d@ to each side, fixing the missing +C, and dividing by 2 yields the desired 


result. 


Remark 1.2.4. You may wonder why we didn’t give integral formulas for the co-functions in 
the previous three propositions. The reason for this is that you should memorize fundamental 
integral formulas, and then use general techniques to quickly derive others, if possible. 


Since replacing 6 by 7/2—6 changes any trig function into the corresponding co-function, the 
substitution u = 7/2—0 tells us that the integrals of the co-functions of those in Proposition 
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Proposition|1.2.2| and Proposition are obtained by negating what we obtained and replacing 
all of the trig functions by their co-functions, i.e., 


[cot 9.a9 = In|sin6|+C, 


J escoas = —In|csc6+ cot 0|+C 


and 


C. 


Acoté+1 0 t 
[scan _ _ esc @co + in iese + co ha 


The following two iteration formulas for integrating powers of sine and cosine are frequently 


useful. They are proved by using integration by parts. 


Proposition 1.2.5. Ifn > 2 is an integer, then 


1 —1 
[sous = ——sin™"1@cos@ + na f sin? a, 
n n 


1 —l1 
[cos oa = —cos”"!@sin@ + Toe f cos"? 048. 
n n 


Proof. These are obtained from integration by parts, and the two demonstrations are entirely 


similar. We will derive the f cos” 9 d@ formula, and leave the other as an exercise. 


We have 
i: cos" 9d@ = / cos”! 6- cos 6 dé. 


Let u =cos"—' 6 and du = cos@d@. Then, du = —(n — 1) cos"~* 6 sin @ dO and v = f dv =sin0. 
We find 
/ cos” 6d = / udv = wf vdu = 
cos”—! @sin 9 — / sin@ - (—(n — 1) cos”~* Osin@)d@ = 
cos”! @sin@ + (n— 1) / cos” * @sin? 6d = 


cos”~! @sin@ + (n—1) / (cos”~* 9)(1 — cos? 9) d0 = 
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cos”! @sin@ + wn) f cos”~? 9d — (n-1) f cos" 6 dé. 


Therefore, we have concluded that 


[cos oa = cos”! @sin@ + (n=1) f cos? 008 - (n1) cos 048. 


Adding (n—1) [ cos" 6 d6 to each side of the equation, dividing by n, and replacing the missing 
+C yields the desired result. 


Integration by trigonometric substitution: 


Integration by trigonometric substitution (trig substitution) refers to having an integral 
involving some variable, say x, and “letting” x equal an expression involving a trig function, 
e.g., x = asind or x = atan9, and then using properties of trigonometric functions to produce 
a manageable integral in terms of 6. We placed the word “letting” in quotes in the previous 
sentence because we don’t really get to “let” x be anything; it is what it is. 

What we can do, however, is define a new variable 6 in terms of x. Hence, what we really do 
are things like “let @ = sin~'(2/a)” or “let 6 = tan~!(x/a)”, which actually imply more than 
x = asin@ and x = atan49, respectively; they also imply that the values of @ are restricted to 


the intervals [—2/2,7/2] and (—7/2, 72/2), respectively. 


Let’s look at three examples. 


Example 1.2.6. Evaluate the integral i Va? — x? dx, where a > 0 is a constant. 


Solution: 


As the integrand is Va? — x?, we must have that —a < x < qa; it follows that —1 <a/a< 1. 


This is important since the closed interval [—1, 1] is the domain of sin~*. 


We now let @ = sin7*(x/a), so that « = asin@ and —1/2 < 6 < 1/2. Then, 


dz = acosé@dé, 


and 


Vaz —x22 = Va2—a?sin?6 = aV1—sin?6 = avcos?6 = acos6, 
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where we used that Va? = a, since a > 0, and that cos? @ = cos@, since cos@ > 0, because 
—n/2<0< 7/2. 


Therefore, we obtain: 


[vee dx = | 2058: acos6.d8 = a? [ cos? 0.8. 


Now, either by using the trig identity that cos? @ = (1 + cos(20))/2, or by using the cosine 
iteration formula (which is proved using integration by parts) from Appendix [B} we know that 


1 
[cost oa = 5 L(sin 8 cos 8) +0) +0. 


[vee dx = “[(sindcos6)+6|+C = 


[(asin @-acos@) + a6] +C = 


NleR 


= E a2 — x2 +a? sin7 (=)| +C. 
2 a 


Example 1.2.7. Evaluate the integral / Va? + x? dx, where a > 0 is a constant. 


Solution: You might suspect that this integral would turn out to be something similar to the 
previous answer. After all, all that we changed was a minus sign to a plus sign. However, this 


seemingly simple change alters the problem dramatically. 


Let 6 = tan~!(x/a), so that = atan@ and —1/2 < 0 < 7/2. Then, dz = asec? 6d6, and 


Vart+an2 = Va2 + a2 tan? 6 = aV1-+ tan? 6 = avsec2@ = asec, 


where Va? = a, since a > 0, and Vsec?@ = sec@, since sec@ > 0 because —1/2 < 0 < 1/2. 
Thus, we find 


[vere dx = J asec asec? 0.0 = a? [ sec ab — 
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9 sec@tan6@ + In|secé + tan 0| 
a’: 
2 


2 yy | Va? +a? 
ao? +a? + a? In| YEE + 
2 
aa? + x? + a? In |a? + 2? + a| we 
9 ’ 


C= 


+C = 


where, in the last step, we used the property of the natural logarithm that In(w/a) = Inw-—Ina, 
and absorbed the resulting constant —a? In a/2 into the constant C. 


Example 1.2.8. Evaluate the integral 


1 
———_~— d 
==>» v; 


where a > 0 is a constant and n > 1 is an integer. 


Solution: 


Again, we let 6 = tan~!(z/a), so that x = atan@ and —7/2 < 6 < 7/2. Then, dr = 
asec? @d0, and 


a+a? = a? +a*tan? = a?(1+tan?6) = asec’. 
Thus, 


1 1 : 
/ we" => l= 6 dé = 


1 
aad) secanaag © a uae cos?”—? 6 dé. 


This final integral can be calculated using the cosine iteration formula in Proposition 


For instance, when n = 1, we recover a formula that we already knew: 
1 1 1 
/ > + dr = cy 1d0 = -94+C = — tan? (=) +0. 
a? + x2 a a a 


When n = 2, we obtain 


! 1 1 1 
a dz = a I cos? 9.8 ——— (G cond sind + 54) +C = 
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1 ax 1/2 
2a3 (= + x? oo (2) sec 


1.2.1 Exercises 


In each of Exercise 1 through 19, calculate the anti-derivative. 


i / tan(30) d6. g 


2: y 2sec(5t) dt. 


3. [seo + tan(3y) dy. g 
A, [ pecot(e?) ae g 


5. [ios x) (cot(sinx)) da. 


> 


4 

sec*u—1 

/ at“ du. Hint: factor before anti-differentiating. 
secu — 1 


ie [esctla + ese? t) dt. 
8. [vE-# do. 


9. Jov 25 — ¢*d¢. Hint: do not use a trig substitution. Compare your answer to that of 


the previous problem. 


10. | V304 ay. g 
dk 
Pe lees 


dx 
12. ' 
/ v4 + 18a2x2 + 81 


13. ‘i Vv 10 — 49v? dv. 
14. / Vx? + 6x +21 dz. Hint: complete the square. 
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i) a 
9) eee 


16. [ve0? vay. 


17. [oot v dv. 


18. fev 16 — e2% dx. g 
19. i Vf e2% — 16 dz. 


20. Prove the sine iteration formula: 
1 -1 
[ow 6d6 = —— sin”—!@cos6 + aoe ; sin”? 6 dé. 
n n 


Assume n > 2. 


21. The reduction formula below gives a method for calculating integrals of powers of the 


tangent function. Verify this formula in the case n = 2. 


n—-— 


[en odd= : tan’ * b= peo odd. 


Use the angle addition identities to calculate anti-derivatives in the next three 
exercises. Assume in each exercise that a and 6 are positive integers. These integrals 


are extremely important in Fourier analysis. 
22. J sin(aa) sin(ba) dx. 
23. [sntav) cos(by) dy. 


24. J costaw) cos(bw) dw. 
25. Explain why the results in the last three problems do not hold in the case a = b. 


26. Assume that n > 1 and show that: 


ie sin zdz = —z” cosztn f 0 cos z dz. 
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27. Assume that n > 1 and show that: 


[ troosede =a" sinz—n fer sinzds, 


ic) 


Use the integral formulas from the previous problems to calculate the anti-derivatives 


in Exercises 28 - 38. 


28. fe sin x dz. 


29. Jv cosy dy. 

30. [wsinwau, 

31. [ Beoszde, g 
32. / sin® ddd. 

33. [sstvae, 

34. [rant tat, 

35. | tan* udu. 
36. / sin(92) sin(22) dx. 
37. 


cos(42) cos « da. 


38. 


es Fe ee, 


sin(3t) cos(5t) dt. 


Recall that the momentum, p, of a mass m moving with velocity v is given by p = mv. 
In Exercises 39 - 42 you are given the mass, acceleration, and the momentum at 


one specific time. Find the momentum function, p(t). The units of acceleration are 


kg-m 
meters per second per second. The units of momentum are : 
sec 


39. m = 12 kg, a(t) = 22/16 — #2, p(0) = 8. 
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40. m=9 kg, a(t) = ———— 


62 
Jag 


t 


41. m=5 kg, a(t) = 


42. m = 20 kg, a(t) 


(0) =0. 


~ Moore 


Integrate the following products of trig. functions. 


43. [sw’ x cos? x dx. 


44. / sin® wcos* w dw. g 


45. [ tanysec? y dy. 


46. [iex’ zsec? z dz. 


Find the general solution to the separable differential equation. 


dx V9— 1/44 2? 


dt er 


48 dy  3y/y? — 16 


dx Qx2/an? +4 


AT. 


dx 23 


49. = ‘ 
dz aVz2 +9 — 9x2 — 7222 


dy _ /2y — y? — Qx2y + xy? 


50. = a 


. Assume |x| < 1 and |y| < V2. 
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> 
1.3. Integration by partial fractions 


A rational function is one defined by the quotient of two polynomial functions, e.g., 


we —Te+ar x? —5at+1 
—______— and ——, 
r+5 x3 + x 


where the domains exclude roots of the denominators. This section describes the fundamental 
techniques for integrating rational functions. Integration by partial fractions refers to an al- 
gebraic technique for obtaining the partial fractions decomposition of a rational function, and 
then integrating the resulting, easier summands in the decomposition. The partial fractions 
decomposition is essentially what you get by “undoing” the work you have to do to add rational 


‘ 


functions and write the result as a single rational function, i.e., you have to “undo” finding a 
(least) common denominator and simplifying the numerator after writing all of the fractions 


over the common denominator. 


Example 1.3.1. For instance, if you want to write 


as a rational function, that is, as the quotient of two polynomials, you would use the common 


denominator (x + 7)(a — 2) and obtain 


3 5 _ 8(@—-2) 5(@ +7) 
x+7 z-2  (£+7)(x—2) (x + 7)(x — 2) 


3a —6+ 5x4 35 8x + 29 


(2+7)(24-2) 22+52-—14° 


The corresponding partial fractions problem is to start with 


8x + 29 
v2 +5a@—14 
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and produce its partial fractions decomposition, i.e., to find that this rational function equals 


How does this help with integration? It means that, if we want to calculate the integral 
/ 8x + 29 d 
—>———_ dz 
v+5r—-14 °” 
then, what we need to do is calculate 


3 5 1 1 
—3./ -~_qd 2) oe = 
[(a+a)« 3 f +5. f ods 


3lnja+7| + 5In|a—2| + C, 


where the final two integrals were accomplished via the easy substitutions wu = «+7 and 


w= x — 2, respectively. 


So, how do you find partial fractions decompositions, and how do you integrate all of the 
resulting summands? Before we state the general process/technique, we shall first give a few 
examples of integration by partial fractions, and work through them slowly, in order to discuss 
most of the cases and issues that you will typically encounter. 


Example 1.3.2. Let’s look at the example above, in reverse. We know how to integrate the 
summands that we know will appear. The question is: how do you produce the partial fractions 


decomposition of 
8x + 29 


x? +5e¢—14° 
(We are, of course, pretending that we don’t already know the answer.) 


First, you note that the numerator has smaller degree than denominator. If this were not 
the case, the first step would be to (long) divide the numerator by the denominator. We shall 


look at such a case in the next example. 
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The next step is to show either that the denominator factors into a product of two degree 
one (i.e., linear) real polynomials, or you complete the square to show that x? + 5x — 14 is an 
irreducible quadratic polynomial, that is, a quadratic polynomial that does not factor into two 
linear (real) polynomials. Hopefully, you see quickly (even if you have forgotten our work prior 
to this example) that x? + 52 — 14 factors as 


oe +5a—14 = (2 +7)(¢ —2). 


However, if you did not notice this factorization, you would need to complete the square. 
Since the leading coefficient (the coefficient in front of 2?) is a 1, we complete the square by 
squaring half of the coefficient in front of the x term, and then adding and subtracting the 
resulting quantity; in this example, this means we add and subtract (5/2)? = 25/4. What do 
we mean by “adding and subtracting 25/4?” Exactly what we wrote; it’s true that this adds 
zero in the end, but it adds zero in a very clever way. We obtain 


e+5¢e—-14 = 2?4+5r4 14. 


25 5\? 
The point is that #? + 5a + ri is a perfect square; it equals (« + 3) . Thus, we find that 


5\? 25 5\? 81 
o?+5¢—14 = (=+3) —-—-14 = (2+5) -—, (1.3) 


The fact that we end up with something squared minus a positive quantity means that this 
result can be factored (over the real numbers) as the difference of squares; hence, as we already 
knew, x? + 5a — 14 factors as 


5\? 81 5\? oY 
2 —— _— —— = —_ part — — _— 
a +5a—14 («+3) 7 («+3) (5) 


(2+3+5) (<+3-5) = (2+7)(e-2). 


Had we completed the square and ended up with something squared plus a positive quantity 
in Formula [1.3] then the quadratic polynomial would have no real roots and, hence, would be 
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an irreducible quadratic polynomial. We shall look at such an example in Example 


Discussing what to do in other cases is making this example much longer than it really is. 
All that we have really done to help us with our current problem is to factor the denominator, 


i.e., so far, what’s relevant to this example is that 


8x + 29 8x + 29 


ge+6e—14 (24+ 7)(4—2) 


We would like to “undo” the obtaining of the common denominator and write an equality of 


functions: 
8a + 29 _ ? 2 (1.4) 
@+7)(e-9) ~ #47 2-2 


but what goes in place of the question marks? 


It can be shown that there are unique polynomials that go where the question marks are 
and, because the numerator on the lefthand side has smaller degree than the denominator, the 
degrees of the numerators on the righthand side of Formula [1.4] must be less than degrees of 
the denominators. However, the denominators have degree 1, and so the numerators must be 


polynomials of degree 0 (or the zero polynomial), i.e., the numerators must be constants. 


Therefore, we know that there exist unique constants A and B such that, for all 2 unequal 


to —7 and 2, we have the equality 


se+29 = A | B ak 
Giese) #47 ~ 2-3" : 


but how do we figure out what A and B are? 


The first step is to “clear the denominators” by multiplying both sides of the equation by 


the big denominator on the left, and canceling factors. We obtain the equality 


8a +29 = A(x—2) + B(x+7), (1.6) 


where, initially, this equality is required to hold for all x, except x = —7 and x = 2. However, 
the polynomial functions on each side of the equality are defined and continuous everywhere; 
hence, if they are equal for all x, other than —7 and 2, then they must, in fact, be equal when 


a2 = —T7 and when x = 2. 


It is important to emphasize that the equality in Formula{1.6]must now hold at those x values 
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which we couldn’t use earlier, those which made the original denominators in Formula[1.5] equal 
zero. The reason that this is important is because the easiest way to determine A and B is to 
plug « = —7 and x = 2 into Formula|1.6] We find: 


When x = —7: 


8(—7) +29 = A(—7—2)4+ B(0), andso —27=-—9A. Hence, A = 3. 


When xz = 2: 


8(2) +29 = A(0)+ B(2+7), andso 45=9B. Hence, B=5. 


Thus, plugging A = 3 and B = 5 back into Formula[I.5| we finally obtain the partial fractions 


decomposition 


8x + 29 8x + 29 3 5 


ge+5ea—-14 (2£+7)(e-2) «+7 Pi r-2 


If you now wish to integrate this, you proceed as we indicated before this example, by making 
easy substitutions, and you obtain 


8 29 


We refer to our method above for determining A and B by plugging in exceptional values 
of x as the exceptional values method. There are at least two other methods for determining A 
and B, which are very similar to each other. 


One method is simply to substitute any two, distinct, « values into Formula [1.6] this will 
yield two linear equations, involving A and B. You solve these equations simultaneously, and 
you will still find A = 3 and B=5. (Try it.) 


The other method is one that we refer to as matching coefficients. You “multiply out” 
Formula then collect the powers of x, and match the coefficients of the corresponding 
powers of x on each side of the equation, since two polynomial functions (in the variable x) are 


equal if and only if the coefficients in front of each power of x agree. 


44 CHAPTER 1. ANTI-DIFFERENTIATION: THE INDEFINITE INTEGRAL 


Following this method, we obtain 


82 +29 = Ar—-2A+Br+7B = (A+ B)xn+(—-2A+7B). 


The constant terms on each side of the equation must agree, and so must the coefficients in front 
of x. Therefore, we find that 29 = —2A4+ 7B and 8= A+ B. Solving these equations, we once 
again find that A = 3 and B=5. 


Example 1.3.3. Consider the integral 
/ x? + 22? —21¢+71 
dz. 
x*+5x—14 


True, it’s disgusting-looking, but it’s really not so bad. We want to produce the partial fractions 


decomposition of the integrand, and integrate the resulting “pieces.” 


This time, the degree of numerator is greater than or equal to the degree of the denominator. 
The first step is thus to long divide x3 + 2x? — 217 + 71 by x? +52 — 14. 


x —3 

a* + 5a — 14) xv? + 2x? — 21¢+71 
— a? — 5a? + 142 

— 30? —7r+4+71 

3a? + 15a — 42 

8x + 29 


What does this division tell us? It tells us that there is an equality 


a3 4+ 2¢7—-2e+71 _ 8x + 29 
v2 +5ae—14 7 xv? +5a—14° 


The remaining fractional part is precisely what we integrated in the last example. Hence, we 
conclude that 


x? + 2a? — 217471 8a + 29 
dx = | (e—3)d ee es i 
I( x? + 5a — 14 ) r fe jar + fares) i 
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2 


- — 37 + 3ln|x+7| + 5In|x—2| + C. 


Example 1.3.4. Consider the integral 


i z+ de 
(a — 5)(a? + 6a + 11) 


The numerator has smaller degree than the denominator, so there’s no need to long divide. We 
either need to factor x? + 62 +11, or complete the square. A few seconds of thought should 
convince you that it doesn’t factor into linear terms with integer coefficients; still, it could factor 
with real, but irrational coefficients, or be irreducible over the real numbers. Completing the 
square will tell us, either way. We add and subtract (6/2)? = 9, and find 


ao? +6¢+11 = 2? +62+9-9411 = (2 +3)? +2. 
This polynomial has no real roots, and so is irreducible (over the real numbers). What do we 


do? 


Now that we know that 2?+6x+11 is irreducible, we temporarily put off using that it equals 
(2 + 3)? +2. What we want to do now is write an equality of functions 


zt+l ? ? 


(n—5)\(e2?+624+11) 2-5 7 x2 + 6x4 11’ 


but what should the question marks be this time? 


As before, it can be shown that there are unique polynomials that take the places of these 
question marks and, because the rational function on the left has a numerator of smaller degree 
than the denominator, the question marks are polynomials of smaller degree than the respective 
denominators. Thus, there exist unique constants A, B, and C such that 


e+1 _ A i Brt+C (1.7) 
(29 —5)(a2?+6e+11) 2-5 x? +62 +11" 


We wish to determine A, B, and C. 


Again, we clear the denominators by multiplying both sides of the equation by the big 


denominator on the left, and canceling. We obtain 
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g+1 = A(z?+6e+11) + (Bre+C)(e —5), 


which now must hold for all x. Plugging in = 5 immediately tells us that 6 = A-66+B.-0, so 
that A = 1/11, but there are no easy (real) values to plug in for x to immediately yield B and 
C. We will multiply things out and match coefficients. 
We find 
a+1 = Ax’?+6Ar+114+ Bo? —-5Br+Cxr-—5C = 


(A+ B)x? + (6A—5B+C)x + (11A—5C). 


Therefore, noting that 0 is the coefficient of x? in the polynomial 2 + 1, we obtain the three 


simultaneous equations 


A+B=0, 64—5B+C =1, and 11A—5C =1. 


As we already know that A = 1/11, we really only need two of these equations, say the first 
and the last, to find that B = —1/11 and C = 0. Substituting these values into Formula [1.7] 


we obtain 


x+1 ol 1 x 
(c—5)(22+62+11) 11 \e2-5 x?+6xr+11/)° 


Before integrating, we wish to rewrite «/(x? + 62 +11) in a different form. Recall that we 
completed the square to find that x? + 6% +11 = (a +3)? +2. Thus, 


x x (7+ 3)-—3 


ge+62+11 (£+3)?4+2 (#+3)?+2’ 


where we added and subtracted a 3 in the numerator in order to have the same quantities in 


parentheses in the numerator and denominator. We shall see momentarily why this is useful. 
Putting together all of our work above, we have 
r+1 1 1 L+3 1 


@=lie+6ntil)— T|le=-5 G4ae42 °° G@aaees 


This, finally, is the partial fractions decomposition that we need. We now discuss how to 
integrate the main “pieces” of the righthand side above, and will then put these pieces together 


to obtain the final answer. 
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Certainly, the integral of 1/(a — 5) is easy. By making the substitution u = x — 5, we find 
quickly that 


1 
/ dx = Inja—5|+C. 
xr—5 


By making the substitution w = (a + 3)? + 2, we find that dw = 2(a + 3) dz, and so 
dw/2 = (a +3) dz. Hence, 


L+3 1 dw 1 1 


where the absolute value signs disappeared since (a + 3)? +2 > 0. 


If we let v= 2+3, then dv = dz, and we find 


lawn -/aa 


which, by the second formula in Theorem|1.1.18} is equal to 


U 


S -tan + (=) +C3 = S -tan + (=) + Os. 


Combining all of our work above, we, at long last, conclude that 


y z+1 de ae 
(a — 5)(a? + 6a + 11) a 


1 1 
Tl hn 5| 5 in ((x + 3)” + 2) + —-tan~* (=) +C. 


Example 1.3.5. As our final example of integration by partial fraction, we will calculate the 


integral 


1 
/ ( + 2)(« — 1)3(a2 +4) a 


Note that the denominator of the integrand has a repeated linear factor, (z—1)?. This factor 


is, of course, a polynomial of degree 3, but it is important that it is a repeated linear term. As 
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you would expect, we want to find the partial fractions decomposition: 


1 A ? Bat+C 


Ge G-1'Glsd 240 eas” 2a’ 


where x? + 4 is clearly an irreducible quadratic polynomial, and the question mark should be 
a general polynomial of degree 2, one degree less than the denominator. The slightly tricky 
point is that, instead of writing the polynomial in the numerator of the (x — 1)* term in powers 
of x, it is best to write it in powers on (a — 1). Thus, we replace the question mark with 
D(a — 1)? + E(a —1) + F. This yields a summand 


Die-1)?2+B(e-N4+P  D E F 
(e198 e—-1' @—-1? @-D* 


This leads to the general rule that, when looking for the partial fractions decomposition 
of a rational function, when there are repeated linear factors in the denominator, you have 
summands of the form a constant over each repeated power of the repeated linear factor, up to 
the power that appears in the original denominator. In a similar fashion, if we have repeated 
powers of an irreducible quadratic factor in the denominator, we have summands of the form a 
general linear polynomial over each repeated power of the repeated irreducible quadratic factor, 
up to the power that appears in the original denominator. 


Hence, we want to find constants A through F’ such that 


1 
(a + 2)(a — 1)3(a? + 4) 


A D E P Ba+C 


x+2 | z—1 | (a — 1)? i (z — 1)8 | g2 +40 ae) 


However, even before finding the constants, we can easily write the integral of this rational 
function, with the unknown constants still to be determined. You should be able to show quickly 
that 

A D E F Ba C 
lS Pad elf” Gas ed | ac) a 
E F 


Aln|x+2| + Din|z-1| ai a 1p | 


B 
: In(a + 4) 4 ae (5) +K, (1.9) 


where K is an arbitrary constant (which cannot be determined, unlike A through F’). 


We need to find the constants A through F to finish the problem. You clear all the de- 
nominators in Formula [1.8] by multiplying both sides by the big denominator on the left, and 
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obtain 
1 = A(x—1)?(2?+4) + D(x +2)(x —1)?(2? +4) + 


E(a + 2)(% —1)(27 +4) + F(vt+2)(2? + 4) 4+ (Br +C)(z + 2)(2 — 1)°. 


What a mess! Still A and F’ are easy to find; plug in « = —2 and x = 1. We find 1 = 
A(—2 — 1)3((—2)? + 4) so that A = —1/216, and 1 = F(1+ 2)(1? +4), so that F = 1/15. To 
find the remaining constants in the partial fractions decomposition, we leave it as an exercise (a 
horrible exercise) for you to expand the terms, collect the powers of 2, match the coefficients, and 
solve the resulting simultaneous equations. Alternatively, many calculators, computer programs, 
and the web site wolframalpha.com can produce partial fractions decompositions. What you 
should find is that B = —9/1000, C = 13/500, D = 46/3375, and E = —11/225. Now, you 
substitute the values of A through F into Formula [1.9] and you’re finished. 


As we saw in the examples above, there are always two major steps in using partial fractions 


to integrate a rational function f(x): 
e Find the partial fractions decomposition of f(z). 


e Integrate each summand in the partial fractions decomposition. 


Of course, we have yet to state exactly what the partial fractions decomposition for an 


arbitrary rational function is. We shall do so now. 


Suppose n(a) and g(a) are (real) polynomial functions, and that the leading coefficient of 
g(x) is a 1, ie., q(x) is not the zero function, and the coefficient of the largest power of x 
appearing in g(a), with a non-zero coefficient, is a 1. (If the leading coefficient of g(a) were some 
other (non-zero) constant c, immediately factor out the c, follow the procedure given below, and 


then, in the end, multiply by 1/c.) 


We want to define the partial fractions decomposition of the rational function f(x) = 
n(x)/q(a), whose domain is all a such that q(x) 4 0. 


1. If the degree of n(x) is greater than, or equal to, the degree of q(x), then you first (long) 
divide q(x) into n(x); if what you obtain from the division is a polynomial p(x) and a 
remainder of r(x), then exactly what that means is that 
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where r(a) = 0 or has degree less than the degree of g(a). As it is easy to integrate 
the polynomial portion p(x), we will concentrate on the portion r(x)/q(a). In addition, if 
r(a) = 0, then there is nothing left to do, so we will assume that r(x) 4 0. 


So, at this point, we want to define and integrate the partial fractions decomposition of 
r(x)/q(a), where r(x) and q(x) are polynomials, r(z) 4 0, g(x) 4 0, the degree of r(x) is 
strictly less than the degree of g(x), and the leading coefficient of q(x) equals 1. 


. Factor g(x) into, possibly repeated, linear terms, each of the form (a — 1)’, and, possibly 


repeated, irreducible quadratic terms, which, after completing the square, can each be 


written in the form [(2 — r)? + b?]™. 


. r(x)/q(x) is equal to a sum of contributions, where you have a contribution from each, 


possibly repeated, irreducible factor of g(x). For a factor of the form (a — r)™, the 
contribution to the partial fractions decomposition is a sum of the form 
Ay Ag Am 


zor ore! ary 


where all of the A,’s are constants (to be determined). 


For a factor of the form [(a — r)? + 6?]™, the contribution to the partial fractions decom- 


position is a sum of the form 


By(a-—r)+C, i Bo(a—r)+C th, Bm(a-—7r)+Cm 
(a — r)? + b? [(a — r)? + b?]? (x — rr)? + b2]™’ 


where all of the B;’s and C;’s are constants (to be determined). 


. Now, set r(x)/q(x) equal to the sum of all of the contributions described above, and clear 


the denominators by multiplying both sides of the equation by q(x), and canceling factors. 
The result is that the polynomial r(a) equals a sum of polynomials with (possibly many) 


unknown constants; these polynomial functions must be equal for all x. 


. Determine the unknown constants above by plugging in exceptional values (roots of q(a)) 


for x, or by multiplying out the entire righthand side, gathering the powers of x together, 
and matching coefficients with the coefficients of r(x). A combination of these two methods 


may be easiest. 


. After determining all of the constants, the sum of the contributions described above, with 


the all of the values of the A;’s, B;’s, and C;’s inserted is the partial fractions decomposition 


of r(a)/q(a). 


. The individual summands in the partial fractions decomposition are integrated as follows. 


We will omit the arbitrary constants produced by the indefinite integral. 


The integrals with powers of a linear factor are easy: 
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A 
i dx = Aln|x— rl, 
“L—-rT 


/ A dx = A(z —r)~™+1 
(a—r)™ —m+1 


and if m > 2, 
The integrals with powers of an irreducible quadratic term are harder: 
Bia-—r)+C / B(a—r) | C 
og i ng ty, 

/ [(@—r)2 +8" (@one+ eye + fF ere 
In the first integral, the substitution u = (2 —r)? + b? yields du = 2(x — r) dx, so that we 
have . B 4 

/ _ B-r) 4 _ B | By 
[(a — r)? + b2]™ 2 um 


B B 
3 In |u| = x In((x — r)? +b?) 


which, if m = 1, is 


and, if m > 2, is 
B yori B ((a = r)? ae by -wert 


2 —m+1 2 —m+1 


Finally, we need to integrate terms of the form 


| eater 


After pulling the constant C' outside the integral, and making the trivial substitution 
u=x-—r, we are reduced to an integral of the form in Example|1.2.8} see that example 
for the technique. 


1.3.1 Exercises 


Use partial fractions to calculate the anti-derivatives in Exercises 1-25. 
lla + 26 
1. —+———. d 
/ a t+bat+6 g 


t+58 
2. free 
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8u — 
ae 
ae 49 dy 
llz-—2 
e: Jes yf 3-1" 
w? + 6w? — 4w +11 
6. d 
[- w? +3w—4 
pee wae yy 
7. 
pt+3 
34 492 
8. | 20 ~ 0-2 4 
ft +12074+ 512+ 74 
dx 
x? + 8a +15 
10. [- + 7s? Teri). 
s+7 
iL. Tm? +12m+4+8 an 
(m + 3)(m? + m+ 1) 
12. (S35 —4n+ 20 Ae 
nm — 2)(n? + 5) 


—3v? + 4u — 66 
13, d 
- (v — 6)(v? — dv + 18) 
3r 
Ce er 
i r44 5244 . 
/ 12¢? + 52t + 150 
15. 
(t? + 5t + 30)(2t + 3) 


3a? + 8a4+5 
16. d 
le +2n+5)(a+3) 


dt 


—97)2 — 
17. / ay — dy 
(y? + 4y + 20)(y + 12) 
1— 2w 


a / (w —1)(w2 +w+1) au 


_3¢? — 30 
of rerHera™ 


4s? +4546 
20. 
: lea" 


1.3. 
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i) 


6 


/ 5u? — 3u — 29 
: x du 
(u + 3)(u — 2)? 


—At? —10t — 14 
2, f ae 


(t+ 1)?(t — 1) 
/ —3x7 — 17x — 26 
(a + 3)?(x + 2) 
/ 277 + 187 + 43 
(j +5)8 
i Tr? — 2r? + Or — 22 
(r + 3)(r — 1)3 


no 


dj 


dr 


(A+C)z+AD+ BC 
x?2+(B+D)¢+ BD 


D, and K are constants and neither A nor C is zero. Co 


. Prove that dx = Alnjx+B\4+Cln|a#+ D|+ K where A, B, C, 


. Suppose A, B, C and D are non-zero constants and B and D are both positive. What is 
/ (A+ C)a*+AD+BC | . 
ce 
z4++(B+D)z?+ BD 


Solve the separable differential equations in Exercises 28 - 32. If an initial condition 


is g 


28. 


29. 


30. 


31. 


32. 


33. 


iven, solve for the integration constant. 


dy 7x? + 44x +80 
— 3 = 5 8. 
dx (e+ 5)*(a 2)’ ¥ )=5/ g 


dy _ 2x? — 3a? — 32a — 23 


dx x? — 227 —15 A 

dy — (6x° — 17a? — 274 + 16)(y? + 2y + 10)(y + 3) 
dx (x — 3)8(a + 5)(5y? + 16y — 10) 

dy (—ax — 2)(y* + 9y) 


dx = (3y? + 2y + 27) (a? + 11x 4+ 30)’ 


dy (30° — 23a? + 53x — 32)(y? — 3y + 10)(y +4) 


dx (x? — 6a + 8)(3y? + 8y + 22) 

Recall that the logistic model for population growth is given by 
dP 
—=kP(M-P 
di ( ) 


where M and k are non-zero constants and P = P(t) is the population at time t. Suppose 
Py = P(0) is the initial population. Use the fact that this differential equation is separable 
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and integrate using partial fractions to arrive at the solution to the differential equation: 


MPp 
(M = Po)e— Mt + Po , 


P= 


Integrals involving square-roots can often be solved by a technique involving partial 
fractions and a rationalizing substitution. The idea is to make a substitution that 
allows the integrand to be written as a rational function, without a square-root 
symbol, and then to integrate using partial fractions. The method is outlined in 


Exercises 34. Use the method to calculate the anti-derivatives in Exercises 35 -40. 


34. Consider Is 
rVxt+1 


2 du 
w—1 
b. Use partial fractions to conclude that the integral in part (a) is n}w—1]—In |u+1|+C. 


a. Let u= /x+1. Show that the integral can than be written as i: 


c. Rewrite the anti-derivative in terms of x, thus establishing: 


1]J—InjVl1+24+1/4+C. 
35 ue 9 a 
7 —9 


=In|V¥1+a— 


i dx 
rVvr+l 


36 dx 
, x—V/x+30 
dx 
37. VYe-—1 
38. 


| are 
39. lwo” 


x 
AO. — ——— . where a and D0 are positive constants. 

a—-Vix+b r 
A common application of partial fraction integration is the reaction rate of two chemicals that 
combine to form a third chemical. This is called a bimolecular reaction and is justified by the 


law of mass action. 


Specifically, assume chemicals A and B have initial concentrations of a and b in mols per unit 


volume and react to form chemical C’. Then under certain conditions, the rate of increase in the 
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concentration, c, of chemical C is given by: 


where k is some constant. Assume throughout the next four problems that c < a < b. 


41. Find an explicit solution for the concentration of chemical C, c(t), ifa = 3,b=4,k =2 
and c(0) = 0. 


42. Find an explicit solution for the concentration of chemical C, c(t), if a= 8, b=10,k =1 
and c(0) = 0. 


43. What is the concentration of chemical C' after 15 seconds if a = 2, b = 5, k = 4.5 and 


c(0) = 0? Assume that t is measured in minutes. 


44. Assume that a ¥ b and prove that a general solution to the differential equation is 


ab(1 — efo—?)*t) 
elt) = b — qela—bykt 


Assume that c(0) = 0. 


The 18th century mathematician Leonhard Euler developed a quick way of deter- 

mining the coefficients of partial fraction decompositions of rational functions. Let 

P(x)/Q(z) is a rational function and assume a+bz is a factor of Q(x), but that (a+bz)? 

is not a factor of Q(x). Then, a, = + R(x), and we’d like to determine k. 
Q(z) atba 


Multiplying by a+ bz and using l’H6pital’s Rule, Euler proved that 


wa 


bP(a 
k= lim : 
z——a/b Q' (x) 


Use this method to answer Exercises 45-49. 


+1 
45. We considered the partial fraction decomposition of (e—5) e 46g 511) in Examples 


A 
The coefficient A of the term pan We found to be 1/11. Let’s use Euler’s method to 
tL 
find this term. 


a. Show that a = —5, b= 1 and that Q’(x) = 327 + 2x — 19. 
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A6. 


AT. 


48. 


A9. 


50. 


b. Show that based on Euler’s method, A = lim ae 
a5 3x? + 2x — 10 


c. Show that this limit is indeed 1/11. 


13 


What is the coefficient of x + 1 in the partial fraction decomposition of Pama 
x 


Determine A, B and C in the equation 


v7 +3a+5 A B C 


(x + 2)(a + 3)(x + 5) ec ae ea 


using Euler’s method. Note that (a + 2)(x+3)(~+5) = 2? + 10a? + 31a 4+ 30. 


303 + 4x7 +42 +5 
Use Euler’s method to find the partial fraction decomposition of aaa = = ae . Hint: 
ot — 
use Euler to find the A/(#—1) term. Subtract this term from the original function to find 


the remaining term. 


x™ + (2n — 1) 


Let m and n be positive integers. Let f(x) = ee “. What is the coefficient A 
-—2£ 


of the term A/(a# — 1) in the partial fraction decomposition? 


1 
Consider i ee dz. According to Courant, ”Even Leibnitz found this a troublesome 
x 
integration.” 
a. Complete the square to show that x+ + 1 = (x? + 1)? — 227. 


b. Notice that the expression in (a) is the difference of squares. Use this fact to prove 
gt +1 = (2? +14 V22)(2? +1- 22). 


c. Use the methods in this chapter to prove that 


1 ct V2 a—- V2 
_-_-=K 
et+V/2Qr+1 22—-V2r+1 


where K = 2/4. 
d. Finally, conclude that 


/ dx _ 
vet+l1 


K($In |e? + 2x +1|—$In|2?-V2r+1)4 tan—!(y2e-+1) +tan!(V2e—1)) +C. 
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> 
1.4 Integration using 


Hyperbolic Sine and Cosine 


Corresponding to sine and cosine, we will define the hyperbolic sine and hyperbolic cosine func- 
tions, f(a) = sinh and g(x) = cosh, respectively; when speaking, hyperbolic cosine is usually 
referred to by pronouncing cosh phonetically, while hyperbolic sine is usually referred to by 
pronouncing sinh as though it were the word cinch. 


Once we have hyperbolic sine and cosine, we could, but will not, define the remaining four 
hyperbolic trig. functions, in analogy with how the other trig. functions are defined in terms of 


sine and cosine, e.g., hyperbolic tangent is hyperbolic sine divided by hyperbolic cosine. 


Instead, we will concentrate on the properties of sinh and cosh which make them so useful 


for anti-differentiating certain types of functions. 


The definitions of hyperbolic sine and cosine do not look analogous in any way to the defini- 
tions of the usual sine and cosine functions. Later, in Example]4.6.21} we will see that Euler’s 
Formula provides a close relationship between the hyperbolic and standard trig. functions, but 


you have to be willing to deal with complex numbers. 


Definition 1.4.1. The hyperbolic sine function, sinh: R — R, is defined by 


sinha = 


cosha = 


The following algebraic/graphical/Calculus properties of sinh and cosh are easy to verify, 


and we leave them as exercises. 
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wY 


3 3. #2 &°- O i 2 


Figure 1.1: The graph of y = sinha. Figure 1.2: The graph of y = cosh. 


Proposition 1.4.2. Hyperbolic sine and cosine have the following properties: 
. sinha is an odd function; 
. cosh x is an even function; 
. sinha ts one-to-one and its range is the entire real line; 


. cosha is not one-to-one, but is one-to-one when restricted to the domain |0,00), and 
its range is the interval [1, 00); 


. sinha is strictly increasing, is negative when x is negative, and positive when x is 
positive; 


. cosha is strictly decreasing on the interval (—co, 0], strictly increasing on the interval 
[(0, co), and obtains its global minimum value of 1 when x = 0; 


: / 
. sinh’ x = coshaz; 


a : 
. cosh x = sinha; 


_ 1+sinh? x = cosh? x. 


The last property above is where the term hyperbolic comes from; a point (x, y) = (cosht, sinh t) 
lies on the hyperbola given by 2? — y?2 = 1. You should think of this as being analogous to the 
fact that a point (x, y) = (cost, sint) lies on the circle given by x2? + y? = 1; hence, the “usual” 


trigonometric functions are sometimes referred to as the circular trigonometric functions. 


As hyperbolic sine is one-to-one, onto, and possesses an everywhere non-zero derivative, 
the inverse function sinh~! exists and is differentiable everywhere. However, as cosh is not 
one-to-one, we restrict its domain to [0,0o) before producing an inverse function. We can find 


explicit formulas for these inverse functions by writing « = (eY — e~¥)/2 and x = (e¥ + e7¥)/2, 
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respectively, and solving for y in each case, while paying attention to the restrictions on the 
domain and codomain of cosh~'. The algebra involved is simply the quadratic formula, and 


‘ 


applying the natural logarithm, but the “variable” to which you apply the quadratic formula is 


e¥; we leave this algebra as an exercise also. 


You should find the following formulas for inverse hyperbolic sine and inverse hyperbolic 


cosine. 


Proposition 1.4.3. The function sinh‘ : R > R is given by 


sinha = In (a+ x +1), 


and the function cosh” : [1,00) > [0,00) is given by 


cosh ta = In (2+ V x2 — 1) : 


By using either the algebraic formulas above, or the general formula for the derivative of an 


inverse function, we find (via another exercise for you): 


Proposition 1.4.4. The function sinh‘ : R > R is differentiable (everywhere), and 


1 
Jaz +1 


(sinh~* x)’ = 


The function cosh : [1,00) — [0,00) is continuous; for all >1, cosh~' is differentiable, 
and 


1 
Vaz —1 


(cosh7* a)! = 


We could immediately rewrite the formulas above as anti-derivative formulas, but we want 
to emphasize that the real value of hyperbolic sine and cosine when anti-differentiating stems 


from three properties: that 1 + sinh? « = cosh? x, sinh’ x = cosh, and cosh’ x = sinh. 
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Example 1.4.5. Suppose that you don’t remember the derivatives of inverse hyperbolic sine 


and cosine that we gave above, but you need to find the anti-derivative 


1 
————- dz, 
/ Vax? + a? 


where a > 0 is a constant. 


When you see Vx? + a?, you should realize that there are (at least) two reasonable choices 
for methods of attack; these rely on your knowing that 1+ tan? 6 = sec? 6 and that 1+ sinh? t = 


cosh? t. 


We will first use tan @ and see that the calculation takes a while. We will then look at how 


much easier the integral is using sinht. 


Using circular trig. functions: 


Using tangent, you should think “I want x to be atan@, so that 2? + a? = a? tan? 6 +a? = 
a*(tan?@ +1) = a? sec? 6”, and then Vx? +a? = asec@. That’s how you think about the 
problem, but, being more careful, what you really do is not “let « = atan@”; after all, x is 
already defined in the problem. You define the new variable @ by letting 6 = tan~!(a/a), so 
that, yes, « = atan6, but now we also know that —7/2 < 6 < 1/2. This restriction on @ is 


important, because, what we really get is that 


Vx2+a2 = Va2sec20 = |asecd]. 


To know that this last quantity is equal to asec 6, you have to use that a > 0 and that, since 
—1/2<0< 1/2, secO > 0. 


Therefore, since 2 = atan@, dx = asec? 6 d0 and 


1 1 
dr = | ——ascc? 6d = 6do = 
i pas x | ep te [sc 


In|sec@ + tan6|+C = In(sec@+ tan@)+C, 


where, to eliminate the absolute value signs, we used that our restrictions on @ imply that 
secO + tan@ > 0. To finish the problem, you now use that tan@é = x/a and that sec@ = 
Vx? + a /a. 
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Our final result: 


+C= In (a + +a?) +C, 


1 
l= wate =m (2 a 


where, in the last step, we used that In [(@ + Vx? + a?) /a] = In (w+ Va? + a?) —Ina, and then 
we “absorbed” the — Ina into the constant C. 


Using hyperbolic trig. functions: 


Using hyperbolic sine, you should think “I want x to be asinht, so that «?+a? = a? sinh? t+ 
a? = a?(sinh?t + 1) = a? cosh? t”, and then Vx? + a? = acosht. Again, that’s how you think 
about the problem, but what you really do is not “let c = asinht”; you define the new variable 
t by letting t = sinh” '(«/a). However, we don’t need any special restrictions on t to know that 
Va? cosh” t = acosht, because cosht > 1 > 0. 


As « = asinht, dx = acosht dt, and we find 


: i : _1 (2% 
| ae” = | rmqutoomtt = ft = t+ = sim (2) +0, 


which is the nicest form in which to leave the answer. On the other hand, if you wish to see 
that this answer agrees with the answer that we obtained above, using tangent, then you use 
the formula for sinh~! in Proposition 


Example 1.4.6. Suppose that a > 0, and that we want the general anti-derivative of the 


davietion: AC) : 
unction 8 a ——————— 
V xr2 — a2 


with domain z > a, i.e., we want to find 


1 
/ Vx? — a? a 


where x > a> 0. 


We want x = acosht, so that 


Va? —a? = Va?cosh?t—a? = aVsinh?t = alsinhtl. 
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Thus, we let t = cosh” ‘(a/a), which implies both that « = acosht and, as v/a > 1, that t > 0. 
Since t > 0, | sinh¢| = sinht. 


Therefore, dx = asinht dt and 


1 1 ' 1/2 


The results from the last two examples are worth recording in a proposition. 
Proposition 1.4.7. Suppose that a > 0. Then, 


1 pepe 
lara dx = sinh (=) +C, 


and, for x >a, 


l=" = cosh! e +C. 


1.4.1 Exercises 


Calculate the anti-derivatives of the functions. 
7 
i. ———- dz. 
Fe 
12 
2. ———_ dz. 
| V 22 — 625 g 
d 
Ey i. aes 
V/ 4y? + 49 
A / dt 
“4b af? —81_ 


5. J cosn(3) dx. g 
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6. / ssinh sds. 


7 } at A 
. —— ax. 
Vat+9 


8. [ cost? z—sinh? zdz. Hint: why is this trivial? 


9 } dx 
Vda? + 4x +9 


10 / gy 
Jy? — 10y+9 
1+VB 

it. BEE i 

Vt? +3 
Dees A Age 2: 
12. 1 4+ vu Ea: 
Vut — 16 
Dai i 2 
13. ve 4+ va ees 
xt + 5x2 — 36 
2 
U 
a as, 77 
ae ° 


15. 


a. Calculate - sinh x cosh x dx using integration by parts. 


b. Calculate / sinh x cosh x dx directly by writing sinha and coshz in terms of their 


definitions using the exponential function. 


16. Calculate [Veiae. 


In each of Exercises 17 through 25, prove the given statements from Proposi- 
tion [1.4.2 


17. sinha is an odd function. 
18. cosh is an even function. 
19. sinha is one-to-one and its range is the entire real line. 


20. cosha is not one-to-one, but is one-to-one when restricted to the domain [0, 00), and its 


range is the interval [1, 00). 
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21. sinh x is strictly increasing, is negative when x is negative, and positive when z is positive. 


22. cosha is strictly decreasing on the interval (—oo,0], strictly increasing on the interval 


[0, co), and obtains its global minimum value of 1 when « = 0. 
23. sinh’ x = coshz. 
24. cosh’ x = sinhz. 


25. 1+ sinh? x = cosh? x. 


26. What is / V2? —1dx? g 

27. What is ji 2/42 — 1 dx? 

28. What is y a?/ a? + 1dx? 

29. More generally, what is / a? y/ a2 + a2 dx? 


Use the previous four problems to calculate the integrals in the next four problems. 


30. [Vor ae. 

31. 250? V 250? 7 ae. 

32, / V8 + 10084 de. 

33. cx? \/b2x2 + a2 da, b>0,a>0. 


In exercises 34 though 37, you are given the acceleration function of a particle and 


the velocity at one specific time. Find the velocity function. 


1 
————we | 
vt? + 16t + 25 

1 


35. a t) = Ve—6t+ 16’ v(6) = 8. 


12 
36. a(t) = ———_—_, 03) = 6. 
) V6t? + 24 (8) g 
37. a(t) = 2tcosh(¢? + 2), v(0) = 4. 


34. a(t) = (0) = 10. Assume that t > —8 + V39. 
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38. What is / sinh~'xdx? Hint: use a technique similar to the one used in calculating 


[osear. 


39. What is / cosh! x dx? 


In the next four exercises you are given the acceleration function of a particle. Find 
a formula for the position vector. Make sure to retain any integration constants. 


Use the previous two problems. 


AO. a(t) = am. 
41. a(t) = — 
42. a(t) = are, 0(0) =0 
43. a(t) = = —, v(2) =8. 


Solve the following separable differential equations. 


aa dz 12x? —8 
“dt = 44602 


ae dx (x? +1)3/? 
a ea 


t(@+1) _ et(e-1) 


46. dt ev(ttl) 4 ex(t=1) 


re dx (ote tee 


dt JV t4—¢? — 2? 4 ta?" 
denominator. 


Hint: look for a factor common to the numerator and 


Calculate the integrals of the following products of hyperbolic and trigonometric 
functions. You may find it easier write the hyperbolic functions in terms of expo- 


nential functions. 


48. [sinh cose de. g 


49. / cosh x cos x dx. 


50. 7 cosh z sinh z sin x cos x dx. 


Chapter 2 


Continuous sums: the Definite 
Integral 


As we wrote at the beginning of the previous chapter, the definite integral is the mathematically 
precise notion of what it means to “take a continuous sum of infinitesimal contributions.” It 
is the definition of integration as a continuous sum that yields all of its applications, many of 
which we will look at in Chapter [3] However, the basic method of calculating definite integrals 
is to use the Fundamental Theorem of Calculus, Theorem [2.4.10] to conclude that the problem 
of calculation of an integral (typically) boils down to producing an anti-derivative. Thus, as we 
wrote in the previous chapter, anti-differentiation is also referred to as calculating an indefinite 


integral. 


We begin this chapter with a discussion of basic properties, notations, and techniques involv- 
ing summations; not surprisingly, differences are involved in crucial ways. We then move on to 
the problem of how to approximate a “continuous sum”; of course, part of the problem is that 
we don’t (yet) have a mathematically rigorous definition of what a continuous sum is. After 
we have our approximations, we then use limits of our approximations to define the definite 


integral. 
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7. 
2.1 Sums and Differences 


In this section, we need to recall some notation and properties related to summations. We shall 


also define the difference operator, and use it to derive some useful summation formulas. 


Recall the sigma notation for summations. 


Definition 2.1.1. Suppose that we have two integers m and n, wherem <n, we let [m..n] 
denote the set of integers between, or equal to, m and n. We call such a set an integer 
interval. 

n 
Suppose we have a function f, whose domain includes |m..n]. Then, we write x f(k) for 


k=m 
the summation, as k goes from m to n, of f(k). This means that 


Yo fk) = f(m) + f(m +1) +--+ + f(m—1) + fm). 
k=m 


In this context k is frequently referred to as the index of summation, and f(k) is frequently 
written as f,. The integer interval |m..n] is called the range of the index of summation. 


For example, 


3 
S> ow? = (-1)? +0741? 42743? = 15, 
k=-1 


or, as another example, 


n 
p(t) = ag +aye+--+an2"” = Saxe’, 
k=0 


where «° in the summation is to be interpreted as equaling 1, even if x = 0. 


Note that the indexing variable k is a dummy variable; if we replaced it with an 7 or j, or 
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any other variable (which is not already present), the summation would not change, e.g., 


So 7? = (-1)? +07 417 +27 +3? = 15, 


Summations can be “split”. For instance, in the example above, 
3 


S07? = (-1? +0 4+174+2 43? = 
j=-l 


0 3 


[1 407) he [eo 4 37] _ ‘ pot ris 


In general, we have 


Proposition 2.1.2. (splitting summations) Ifm, n, and p are integers, with m <n < p, 
and f is a function whose domain includes the integer interval |m..p|, then 


P 


n Pp 
fe = So fe + YS fas 
k=m 


k=n+1 


k=m 


where, if n = p, the last summation should be interpreted as being 0. 


Looking once again at 


ye 7’ =— (=P 4 ae. 


we note that we can replace each term j? by ((j + 6) — 6)" (which may seem like a silly thing 
to do, but it leads us to an important property); this means that we have the easy equality 


Cite eee ee = ee Sey ise 7) + OH 77 0 
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10 
which equals YG — 7). 
k=6 
In general, 


Proposition 2.1.3. (shifting indices) Suppose we have a function f, whose domain in- 
cludes the integer interval [m..n|. Let p be an integer. 


Then, by making the substitution k = 7+ p, so that 7 =k—p, we obtain 


n+p 


ya = SS tee 


j=m k=m+p 


Thus, you get the same sum if you add p to each of the bounds of the summation, and simulta- 


neously replace each occurrence of the index k by & minus p. 
Consider now ' 
So (L.7k? — V3 sink) = 
k=7 
(1.7(7)? — V3 sin(7)) + (1.7(8)? — V3 sin(8)) + (1.7(9)? — V3 sin(9)) = 
1.7(77 +8? +97) + (—V3)(sin7+sin8+sin9) = 


9 9 
L750 k? + (-v3) 5° sink. 
k=7 


More generally, the algebraic properties of real numbers (i.e., associativity, commutativity, 


and distributivity) immediately imply: 


Proposition 2.1.4. (linearity of summation) [fa and b are constants, and f and g are 
functions whose domain includes the integer interval [m..n], then 


n 


S- (af + b9x) = ay. dk + b> on: 


k=m k=m k=m 
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Example 2.1.5. The properties of summations described above may seem simple, but they 
allow us to derive formulas involving sums that really don’t look so obvious. Consider, for 


instance, the problem of “simplifying” 


50 50 
2S k(k-1) — So(k+1(k +2). (2.1) 
k=3 k=1 


We would like to combine the two summations, using linearity; however, the ranges of the 
indices of summation would need to be the same, and they are not. We will fix this “problem” 
in two different ways. Understand that the point of this example is not that you will necessarily 
agree that what we end up with is simpler than what we started with; the point is for you to 


understand the types of manipulations that we use. 


First approach: We split the second summation, and have 


2 k(k-1) — S0(R+1)(+2) = 
k=3 k=1 
50 50 2 
2S k(k-1) — SO(k+1)(R+2) — SO(k+1)(k+2). 
k=3 k=3 k=1 


Now we can use linearity on the first two of the three summations above to obtain 


50 2 
S > [2k(& — 1) — (K+ 1)(k+2)] — S(k+1)(R+2) = 
k=3 k=1 


50 
S| (2k? — 2k — k? — 3k — 2) 6 — 12 = 
k=3 


“18 + Yl Ab) 


Second approach: We first shift the index in the second summation in Formula(2. 1} we add 2 
to each of the bounds and replace k by (& — 2) in the summation. We obtain 


50 50 


2S k(k-1) — So(k+1)(k+2) = 


k=3 k=1 
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50 52 


27k 1) — YO ((k-2) +1) ((kK-2) +2) = 
k=3 k=3 
50 52 
2S 7 k(k-1) — So(k-1k. 
k=3 k=3 


The range of the index of summation on the right above still does not match that of the 


summation on the left, but we can split off part of the sum 


2S k(k-1) — S0(k-Dk. 
k=3 k=3 
50 50 
2° k(k-1) - > Ke] — (50)(51) — (51)(52) = 
k=3 k=3 
50 
— (50)(51) — (51)(52) + S> [2k(k-1)—k(k-1)] = 
k=3 
50 
— (50)(51) — (51)(52) + S> [k(k-1)], 
k=3 


which does not look very similar to our answer from the first approach, but is nonetheless equally 


correct. 


We now want to define notation related to differences. 


Definition 2.1.6. Suppose that m and n are integers, andm <n, and suppose that we have 
a real function f, whose domain is [m..n]. Then, we define the finite difference function 
Af to be the function with domain [(m+1)..n] given by 


(Af)(k) = f(k) — f(k—1). 
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Remark 2.1.7. We have been very formal with our notation above. Typically, we write things 
like Ak? = k? — (k — 1)? = k? — (k? — 2k +1) = 2k —1, in place of writing that, if f(k) = k?, 
then (Af)(k) = 2k — 1. 


We should also remark that, in the notation [m..n], we mean to allow the cases where 


m = —oo and/or n = oo. To be precise, we mean that, ifm and n are integers, then 
[—oo..n] = {k | & is an integer, and k < n}, 
[m..co] = {k | & is an integer, and m < k}, 


and [—oo..00] is the entire set of integers. 


Finally, in Definition |2.1.6] if m = —oo, then the m+ 1 which appears in the domain of Af 


should also be taken to equal —oo. 


Like summations, the finite difference operator is linear. 


Proposition 2.1.8. (linearity of differences) Ifa and b are constants, and f and g are 
functions whose domain is [m..n], then 


A(af(k) +bg(k)) = aAf(k) + bAg(k). 


Despite the fact that it is trivial to prove, the following result turns out to be very useful. 


Proposition 2.1.9. (telescoping sums) Suppose that m and n are integers, and m <n. 
If f is a real function, whose domain is [m..n], then 


S> Af(k) = f(n)— f(m). 


k=m+1 
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Proof. A rigorous proof would use mathematical induction. However, it is easy to see why this 


is true. 


d] Af(k) = 


k=m+1 


(f(m+1)—f(m)) + (f(m+2)— f(mt1)) + (f(m+3)—f(m+2)) +... 
i= 2 fs) ef 2) p> Le) fe). 


Note that every term, except f(n) and f(m), occurs once with a plus sign and once with a 


minus sign. Thus, all those intermediate terms “collapse” to 0. The result follows. 


Proposition 2.1.10. Suppose that b is a constant. We have the following formulas for 
finite differences: 


1. Ak=1; 

2. Ak? = 2k—-1; 

3. Ak? = 3k? — 3k +1; and 
4. Abet! = bF(b— 1). 


Proof. These are all easy computations. For instance, 


Ak? = k8-(k—-1)® = ko -— (kh? — 3k? 4+3k-1) = 3k? -3k +1. 


We leave the proofs of the remaining items as exercises. 


Corollary 2.1.11. Suppose that b is a constant. We have the following formulas for finite 
differences: 


1. 


k(k + 1)(2k +1) 
6 


|; and 


3. fbF1, 
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Consequently, 


a. ifn >1, 


b. ifn > 1, 


c. ifn >0 and b¥1, then 
n prt —1 


See 


k=0 


provided that 6° is interpreted as equaling 1 when b = 0. 


Proof. Items (a), (b), and (c) follow immediately from Items (1), (2), and (3), respectively, by 
applying Proposition [2.1.9] We shall prove Items (1), (2), and (3). 


From Proposition |2.1.10| we have 


Ak? = 2k— Ak, 


and so, by the linearity of finite differences, 


b= 5-AQ?+h) = a |="). 


From Proposition 2.1.10] and Item (1), we have 


- = a]z- (+2 2] _ a [BAHN 1) = 2k 
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a [eet ies +1)] 


Finally, Proposition |2.1.10] and linearity immediately yield Item 3. 


We should remark that the summation )>;_, b* that appears in Item (c) of Corollary |2.1.11 
is one which will be very important to us in Chapter [4] and Chapter [5} it is called a geometric 


sum. 


2.1.1 Exercises 


Calculate the sums. 


T 
1. S$ (8k +2). 
k= 


5 
5. 5 (k—1)(k — 2)(k - 3). e 
k=1 
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77 


10 


3 
1 
. > m 


m=—2 


Evaluate the sums by reindexing. 


11 


. 5 (39 - 18). e 
j=6 


8 


5 
12. S°(k—3)? — © (97 + 20). 


13. 


14. 


k= j=l 
9 0 
(>: (k? — 10k + 2») -| >> @? +87 +16) 
k=6 j=-3 
Prove the following statements of Proposition |2.1.10 
Ak = 1. 
. Ak? = 2k -1. 


15 


16 


. Abk+! = be(b— 1). ie 


Calculate Af(k) for the following functions. 


17 


18. 


19. 


20. 


21. 


22. 


. f(k) =5k 
f(k) = C, where C is a constant. 
f(k) = 3k? — 4k +7. 
f(k) =Ink. 
f(k) = sin(2kr). 
fias5 
. Prove that if f(z) = sina, then (Af)(ax) = (sinx)(1— cos 1) + (sin 1) cos z. 


. Prove that if g(x) = cosz, then (Ag)(x) = (cos a)(1 — cos 1) + (sin 1) sinz. 
U 
a. What is > (-1)" when U is even? 
n=0 


U 
b. What is 5 (—1)” when U is odd? 


n=0 


e 
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26. 


27. 


28. 


29. 


30. 


31. 


Is the following statement true? Assume f and g are differentiable functions. 
3 
Sof) | | So9G) | =S5 090) 
j=l 


Prove or give a counterexample. 
100 

What is ————? Hint: use partial fractions and find a telescoping sum. 
» k(k +1) ' ae 


7 
What is ye f(k) if f is an odd function, f(0) = yo, and J is some positive integer? 
k=—J 


What is 3 (vi- Vi-1)? 


j=17 


Suppose that f is a differentiable function. Prove that there exist numbers c; in each open 
interval (i — 1,7), for 2 = 1,2,3, such that f(3) — f(0) = f’(c1) + f’ (ce) + f’(c3). 


Recall that the average of n numbers is given by the formula: 


San ae 


n 


a 


Prove that Se —£)=0. 


i=l 


The sample standard deviation of a data set, s, measures how spread out a data 
set is. For example, if A = {84, 84,84} and B = {80, 84,88}, then the sample standard 


deviation of B will be larger than the sample standard deviation of A since there the 


points of B are more dispersed than those in set A. The sample standard deviation 


of a set with n data points is 


The units of s are the same as those of the data. 
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32. A Calculus test is given to two sections of students. The grades of the students in section A 
are {85, 65, 73, 40, 64, 90} and the grades of the students in section B are {84, 90, 96, 88, 100}. 


What are s, and sp? 


33. Suppose that the week-to-week changes in a stock’s price, in dollars per share, over a six 
week period are: {+2.5, +4, —7,0,+3}. What is s for this set? The standard deviation of 
a stock’s price measures the volatility of the stock. 


34. The recorded annual rainfall, in inches, for a city over a five year span is {23, 47, 35, 42, 29}. 


What is the sample standard deviation? 


35. We define the sample variance to be the square of the sample standard deviation: 


2 doin (Fi = x)? 


ia n—-1 


Prove the following useful alternative formula: 


a (2%) - G/n) Da)? 
Cn 


All sums are taken from 1 to n where n is the number of items in the data set. o 


Oftentimes statistics are used to measure the relationship between two variables. 
For example, researchers could be interested in the relationship between drug 
dosage and cancer cell counts. Basketball executives may be interested in the 
relationship between a team’s free throw percentage, and the team’s overall win- 
ning percentage. The linear correlation coefficient, r, quantifies the relationship. 
Specifically, r helps answer the question: to what extant are the two variables lin- 
early related? r is always between —1 and +1. If r is close to +1 (resp. —1), then 
a strong positive (resp. negative) linear relationship exists between the variables 
in the sense that as one variable increases, the second variable tends to increase 
(resp. decrease) proportionally. If x; and y; are two data sets with means Z and y 


and standard deviations s, and s,, then r is given by: 


wis — 2) (ye — vy) 


= (n—1)szSy 


36. Suppose you work for a large retail store and your manager asks you study the rela- 
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tionship between revenue (sales) and the unemployment rate. Assume you assemble the 


unemployment rates and sales over the last six months in the table below. 


Unemp (%) | Sales ($) 
50350 
37570 
33140 
25550 
38750 
55450 


o1emewoon7 or 


Let x be the unemployment rate and y the sales. 


a. What is %? 
b. What is y? 
c. What is s,.? 
d. What is s,? 


e. What is r? 


37. Prove the following useful formula for the numerator of the formula for r: 
a 1 
Ye - Bw - = ews) - = a ow 


38. Another common question in statistics is, What is the line that best fits the data? As- 
suming you believe that two variables have a linear relationship, you’d like to know the 
what line in the form y¥ = mz + b appears to best describe the data. Here we use a ’hat’ 
to emphasize that the output of the above formula is a predicted value, as opposed to an 


actual value. In classical linear regression, formulas for the parameters m and 6 are: 


a. Use these formulas to calculate the regression line of the above data. 


b. What sales does the line predict for an unemployment rate of 6%? 
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Linear Regression FAQ 


You probably have three good questions now: 


Q In what sense is this the line of best fit? 


A The line of best fit should be ’close’ to the data. We quantify closeness by the square 
of the difference between the predicted and actual data. More specifically, the quantity 


SoG —y)* is minimized. 
Q Where do the formulas for m and 6 come from? 


A We find m and 6 by minimizing the function: 
f(m,) =) 1G — 9)? = dma +b —y). 


This is an elementary exercise in multivariable Calculus. In single variable Calculus, we 
find minima of a function by locating points where the derivative is zero. This procedure 


has a natural analog in higher dimensions. 
Q When is it appropriate to use linear regression? 


A There are several very important technical assumptions lurking behind linear regression 
that are seldom checked. One of these assumptions is that the data should be normally 
distributed about its mean at each y value. Many important economic models, including 
the Capital Asset Pricing Model and the Nobel Prize winning Black-Scholes option pricing 
method rely on the normality assumption. The extremely non-normal market fluctuations 
sparked by the burst of the housing market bubble in 2008-2009 has caused many people 


to question the validity of the normality assumption. 


Exercises 39 - 43 are based on the lyrics of the song The Twelve Days of Christ- 
mas. You may find it helpful to have the lyrics in hand. 


39. Let f(m) be the total number of gifts received on the mth day. On the third day of 
Christmas, for example, a total of six gifts are given. Three french hens, two turtle doves, 


and a partridge in a pair tree. Write a formula for f(m) using summation notation. 


40. What is f(12)? 
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Al. 


42. 


43. 


AA, 


45. 


The cumulative number of gifts received after m days, call this function F(m), is also 
growing. By the end of the third day, for example, 10 gifts have been received: 6 from 
the 3rd day, 3 from the 2nd day and 1 from the 1st day. Express F'(m) using summation 


notation. 
What is F'(12)? 


Research the terms triangular numbers and tetrahedral numbers on the internet. How do 
these terms relate to these exercises? 

; oo ae 
By how much does In 2 differ from S- sa 


nr 


? Does the summation get closer to In 2 if 


n=1 
the upper limit is changed from 4 to 5? 


"(we +1\? fe ai\" 
Let A, = S- ( 7 ) and B, = ye ( 5 ) . Find an expression for A, + Bn. 
u=1 u=1 


Hint: combine the sums immediately rather than dealing with each one individually. 


The Fibonacci sequence is defined recursively. Set Fo = 0, Ff, = 1 and, for k > 2, 


Fy = Fy_1 + Fe_2. This gives a sequence {0,1,1,2,3,5,8,...}. Prove the following facts 


about the Fibonacci sequence. 


46 


2n 


. 0 Fk = Fanta - 1. 
k=0 


nm 
. y Fox—1 = Fan. 
k=1 


nm 
. S- Pox = Fon4i — 1. 
k=0 


ee = as, 
k=0 


2.2. PRELUDE TO THE DEFINITE INTEGRAL 83 


> 
2.2 Prelude to the Definite Integral: ee" 
Riemann Sums 


In differential Calculus, the instantaneous rate of change of a function is defined as follows: we 
have a notion of the average rate of change of the function, and we believe, for small changes 
in the independent variable, that the average rate of change approximates something. We 
then take limits of the average rates of change to define the instantaneous rate of change, the 


derivative. 


In this section, we do something analogous: we will define Riemann sums, which are supposed 
to be approximations to “continuous sums of an infinite number of infinitesimal contributions” . 
As with the passage from average rates of change to instantaneous ones, we pass from Riemann 
sums to continuous sums by taking limits. The continuous sum that we end up with is called 


the definite integral. 


Let’s begin with an extended example. 


Example 2.2.1. Suppose that a car is traveling along a straight road, on which a coordinate 
axis has been laid out in meters. Let p(t) denote the position of the car (on the axis), at time t 
seconds (after some initial starting time), and let v(t) denote the velocity of the car, in meters 
per second, at time t seconds. 


If the driver of the car were to brake hard, or step down hard on the accelerator, he or she 
could easily change the velocity of the car by a noticeable amount in 2 seconds. However, it 
would be difficult to change the velocity of the car in a significant way in a substantially smaller 
time interval, like 0.25 seconds. 


We want to discuss how you would estimate the change in the position, the displacement, of 
the car, between times 0 and 2 seconds, if you were given some velocity measurements. First, 
we should clarify that, when we have two numbers a and b, where a < b, and we write that x 
is between a and b, we mean to allow for the possibility that x equals a or J, i.e., “a is between 
a and b” means a < x < b. If a < b, and we write that x is strictly between a and b, we mean 
to disallow the possibility that x equals a or b, i.e., “x is strictly between a and b” means 
a<a<b. 


Now, suppose that we know that, 


e at some time between 0 and 0.3 seconds, the car is moving with velocity 30 m/s; 
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e at some time between 0.3 and 0.8 seconds, the car is moving with velocity 20 m/s; 
e at some time between 0.8 and 1.2 seconds, the car is moving with velocity 10 m/s; and, 


e at some time between 1.2 and 2 seconds, the car is moving at —2 m/s (i.e., at 2 m/s in the 


negative direction). 


How can we estimate the displacement of the car between times t = 0 and t = 2 seconds, i.e., 
how can we estimate p(2) — p(0)? 


Note that we have the equality 
p(2) — p(0) = (2.2) 


(p(2) — p(1.2)) + (p(1.2) — p(0.8)) + (p(0.8) — p(0.3)) + (p(0.3) — p(0)), 


which results from the fact that all of the terms, other than p(2) and p(0), cancel out, that is, 
the sum telescopes, as in Proposition Thus, to estimate p(2) — p(0), we can estimate the 
four sub-displacements appearing in Formula [2.2| and add them together. 


Okay. Fine. So how do we approximate each of the sub-displacements? 


The answer is that, for each of the four intervals of time [0,0.3], [0.3,0.8], [0.8,1.2], and 
[1.2,2] (subintervals of [0,2]), we use, as an approximation, that the velocity is constant on 
the given interval. Why do we do this? For two reasons. First, the subintervals of time are 
fairly small, small enough so that we believe that the velocity cannot change too much on each 
subinterval. Second, we don’t really have much of a choice, considering that the only data that 


we are given is the velocity at some time in each of the given subintervals. 


Thus, as an approximation, we assume/pretend that, on the subintervals [0,0.3], [0.3, 0.8], 
(0.8, 1.2], and [1.2, 2], the velocity is constantly 30, 20, 10, and —2 m/s, respectively. Now, if the 
velocity is a constant during an interval of time, then that velocity is also the average velocity 
during the interval of time, and we know that the average velocity on an interval of time is the 
change in position on the interval divided by the change in time on the interval. Thus, we have 
the four approximations (in m/s): 


p(2)—p(l.2) 4 p(L.2) — 90.8) p(0.8) — p(0.3) 


~ ~al ~2 d 
2-12 12-08 e 08-03 nen 
p(0.3) — p(0) 
PUY) TPN 30. 
03-0 


Therefore, we approximate that the displacement of the car, between times t = 0 and t = 2 
seconds: 
p(2) — p(0) = 


2.2. PRELUDE TO THE DEFINITE INTEGRAL 85 


(p(2) — p(1.2)) + (p(1.2) — p(0.8)) + (p(0.8) — p(0.3)) + (p(0.3) — p(0)) 
—2(2— 1.2) + 10(1.2—0.8) + 20(0.8—0.3) + 30(0.3-0) = 


—-16+ 44 10+ 9 = 21.4 meters. 


Therefore, we conclude that the displacement of the car, between times t = 0 and t = 2 seconds, 
is approximately 21.4 meters. Note that this is an estimate of the change in the position; we 
cannot estimate (in a reasonable manner) the actual position of the car at t = 2 seconds, unless 


we have p(0) (or, at least, an estimate of p(0)). 


Our data and approximations of the displacements on the subintervals look best in a table. 


subinterval (sec.) [0, 0.3] | [0.3, 0.8] | [0.8, 1.2] | [1.2, 2] 
time (sec.) ty to t3 ta 
velocity (m/s) 30 20 10 -2 
Approx. Ap (meters) 9 10 4 -1.6 
Approx. total Ap = 9+ 10+ 4+ (—1.6) = 21.4 meters. 


Note that we gave the names ¢;, tz, t3, and t, to the times in the corresponding subintervals 
at which we were given the velocities that appear in the 3rd row. Our approximation of the 


total displacement is the sum of the entries in the 4th row of the table. 


How could we obtain a better approximation of the displacement p(2) — p(0) or, at least, 
an approximation that we’d expect to be better? We could subdivide the subintervals above 
into even smaller subintervals of time, and be given velocities at some time in each of the new 


subintervals. 


For instance, if we thought that 0.8 seconds was too large of a change in time over which 
to approximate the velocity as being constant, we might subdivide the last subinterval [1.2, 2] 
above into two subintervals, say [1.2,1.6] and [1.6,2]. We would then need to know which of 
these new subintervals contains the time t4 (both would, if t, = 1.6), and we’d need to be given 
the velocity at some time in the other new subinterval. For this example, let’s assume that ty is 
in the subinterval [1.2, 1.6], and that we know, at some time ts in the interval [1.6, 2], that the 


velocity is —5 m/s. What approximation do we get for the total displacement now? 


Our new table is: 


subinterval (sec.) | [0, 0.3] | (0.3, 0.8] ] (0.8, 1.2] | [1.2, 1.6] | [1.6, 2] 
time (sec.) ty to ts ta ts 
velocity (m/s) 30 20 10 -2 -5 
Approx. Ap (meters) 9 10 4 -0.8 -2 
Approx. total Ap = 9+ 10 + 4+ (—0.8) + (—2) = 20.2 meters. 
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Our new approximation for the displacement p(2) — p(0) is no longer 21.4 meters, but is now 
20.2 meters. 


Do we really know that this new approximation is better than the first one? Not really. For 
all we know, in the last subinterval [1.6,2], the velocity is actually closer to —2 m/s most 
of time, and is only close to —5 m/s for a tiny subinterval around ts. We can try to appeal 
to our intuition and/or physical experience to say that, in 0.4 seconds, the velocity cannot 


change a significant amount, but that’s no proof, and it also leaves us with the question of: 
what’s a “significant” amount? Nonetheless, we “suspect” that subdividing our subintervals 
into smaller subintervals leads to better approximations, provided we take our subdivisions 
small enough. 


Despite the remark/warning above, it is nonetheless true that there is a strong sense in which 


using small enough subintervals guarantees a close approximation; see the remark below. 


Remark 2.2.2. It is possible to make precise in what way the approximation “gets better” 
as the time intervals get smaller. First, we need to assume that the velocity function v(t) 
is continuous on the interval [0,2]. Now, call a subdivision of the interval [0,2] into a finite 
collection of subintervals a partition of [0, 2] (technically, the partition is just the set of endpoints 
of the subintervals; see Definition [2.2.3). Call the length of the longest (any one of the longest) 
subintervals the mesh of the partition. Thus, if the mesh of the partition is less than some (small) 
positive constant 6, then every subinterval in the partition has length less than 6; informally, 
this means that saying that the mesh of a partition is small implies that every subinterval in 


the partition is small. 


Now, we can state carefully in what sense picking small subintervals of time, and being given 
the velocity at one time per time interval, allows you to accurately approximate the actual 
displacement. Given any € > 0 (think: € could be an arbitrarily small positive number), we can 
guarantee that our approximation for the displacement is within ¢ of the actual displacement 
by using partitions with a small enough mesh, i.e., there exists a number 6 > 0 such that, for 
every partition of [0,2], with mesh less than 6, for every choice of one time (a sample point) per 
subinterval at which to be given the velocity, the sum of the products of the lengths of each 


subinterval with the given velocities for the subintervals will be within € of the displacement 
p(2) — p(0). 


The above statement is a result of the definition of the definite integral, Definition 
Theorem and the Fundamental Theorem of Calculus, Theorem |2.4.10 


2.2. PRELUDE TO THE DEFINITE INTEGRAL 87 


We wish to discuss partitions, meshes, sample points, and the summing process above in a 


more general context. 


Suppose that a < b. It turns out to be convenient to define a partition of the interval |a, }] 
by giving the endpoints of the subintervals that we “chop” [a, b] into, rather than defining the 
partition to be the collection of subintervals themselves. 


Definition 2.2.3. A partition P of the interval [a,b], into n subintervals, is an ordered 
set of numbers 40, %1,...,% such that x9 = a, fn = 6, and x < 41 < +++ < an. 


If1<i<n, then the closed interval [x;_1, v;| is the i-th subinterval of the partition P; 
note that the finite difference Ax; equals x;—2;_,, which is the length of the i-th subinterval 
of the partition. 


The mesh of a partition P, denoted ||P ||, is the maximum length of a subinterval of the 
partition, i.e., ||P || = max{Aa;|1<i<n}. 


A set S of sample points for a partition P is an ordered set of points, one for each 
subinterval of the partition, i.e., an ordered set of elements s1,s2,...,5, such that, for all 
i, where 1 <i <n, 5; is in the 7-th subinterval of the partition, that is, rj, < sj; < x. 


A sampled partition of the interval [a, | is an ordered pair (P,S), consisting of a partition 
P of the interval [a,b] and a set S of sample points for P. 


For instance, in Example [2.2.1] we had the partition P = {0, 0.3, 0.8,1.2,2}. The 1st subin- 
terval is [0, 0.3], the 2nd is [0.3, 0.8], the 3rd subinterval is [0.8, 1.2], and the 4th is [1.2,2]. The 
mesh is ||P || = 2 — 1.2 = 0.8. We were not given explicit sample points; we were simply told 
that, in each subinterval, there was a sample point (a time) at which we knew the velocity, and 


we were given those velocities. 


Now, we need to define the general setup for the type of summation that we used in Exam- 
ple We continue to assume that a < b. 
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Definition 2.2.4. Let P = {2,...,@n} be a partition of [a,b], into n subintervals, let 
S = {s1,...,5n} be a set of sample points for P, and let f be a real-valued function whose 
domain includes the set of sample points S. 


Then, the Riemann sum, R3(f), of f, with respect to P and S, is defined to be 


n 


So f(siAei = 


i=l 


tS — wo) + ene — 1) Fe + pa a — ya) + fn) Oh — a1), 


For instance, in Example|2.2.1| for each of our sampled partitions, the associated Riemann 


sums of the velocity are precisely our approximations of the displacement p(2) — p(0). 


We need some terminology for when one partition has “smaller” subintervals than another, 
and for when we enlarge our set of sample points. It will be helpful to recall the notion of a 
subset, a set A is a subset of a set B, denoted A C B, if and only if every element of A is also 
an element of B. 


Definition 2.2.5. A partition Q of an interval [a,b] is a refinement of a partition P of 
[a,b] af and only if every point in P is in Q, i.e., if and only if P CQ. 


If (P,S) is a sampled partition of [a,b], then another sampled partition (Q,T) of [a,b] is a 
refinement of (P,S) if and only ifP C QO andS CT. 


Remark 2.2.6. As an example, our second sampled partition in Example]|2.2.1} the one given 
in the second table, is a refinement of the original sampled partition in that example. 


Understand the point of a refinement: the subintervals of a refinement are obtained from the 
subintervals of the original partition by subdividing some of the original subintervals into more, 
smaller, subintervals. Thus, if P and Q are partitions of an interval, and P C Q, then we have 
an inequality of meshes m(Q) < ||P |]. 


For a refinement of a sampled partition, we not only subdivide some of the original subin- 
tervals, but we keep the original sample points and throw in some new ones, to give us sample 
points in each new subinterval that does not contain an old sample point (or, if an old sample 


point is the endpoint of a subinterval in the refined partition, then we assign that point to one 
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of the two adjacent subintervals, and must select a new sample point for the other adjacent 
subinterval). 


Our interest in Riemann sums is not limited to velocity and displacement. 


Example 2.2.7. Suppose that a balloon is being inflated between times t = 0 and ¢t = 120 
seconds, and the inflation rates, measured in in3/s, are 8, 7, 5, and 2 at times 50, 70, 90, and 
110 seconds, respectively. 


Estimate the change in the volume V = V(t) of air in the balloon between times t = 40 and 
t = 120 seconds. 


Solution: 


We will begin by selecting a partition of [40,120] that allows us to use 50, 70, 90, and 110 
as sample points. An obvious choice is to subdivide [40, 120] into 4 subintervals of equal length 
At = (120 — 40)/4 = 20. Then, the subintervals of the partition would be [40,60], [60, 80], 
(80, 100], and [100, 120], and the points 50, 70, 90, and 110 would, in fact, be sample points for 


this partition (in fact, they are the midpoints of the subintervals). 


subinterval (sec.) | [40,60] | [60,80] | (80,100) | (100, 120] 
time (sec.) 50 70 90 110 
inflation rate (in?/s) 8 ri 5 2 
Approx. AV (in?) 160 140 100 40 


Approx. total AV = 160 + 140 + 100 + 40 = 440 in?. 


We should emphasize that 440 in® is the approximate change in the volume between times 
t = 40 and t = 120 seconds. With no knowledge of the volume of air in the balloon at time t = 40 
seconds, we cannot reasonably approximate the actual volume V (120) of air in the balloon at 
time ¢ = 120 seconds. 


Let’s look at another example. 


Example 2.2.8. Suppose that a circular rod, of length 1 meter, and cross-sectional area 0.01 


m? (i.e., of radius 0.1/./7 meters) is lying along the z-axis. 
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Figure 2.1: A rod of varying density. 


Suppose that, for all x such that 0 < x < 1, at each point in the cross section of the rod at 
x meters, the density of the rod is 6(2) = (1+) kg/m. We would like to estimate the total 
mass of the rod, by partitioning the interval [0,1] into 5 intervals of equal length. 


Thus, each subinterval of our partition will have length Ax = (1—0)/5 = 0.2 meters, and so 
our partition is P = {0, 0.2, 0.4, 0.6, 0.8, 1}. While we shall not use it, we remark that the mesh 
of P is ||P || = 0.2. 


Let’s first use the left endpoint of each subinterval of our partition as the sample point for 
that subinterval. Thus, our left sample set is L = {0, 0.2, 0.4, 0.6, 0.8}. 


This will yield the left Riemann sum, once we determine what function we’re finding Riemann 
sums of. Average density is mass per volume. In an analogous way to how we dealt with velocity, 
position, and time in Example we will assume that, for a small z-interval, the average 
density is approximated fairly well by the (instantaneous) density at any x-coordinate in the 
given interval. Therefore, if we take the density 6(x) at a point, and multiply times a little 
chunk of volume around that point, we will obtain an approximation of the mass of the chunk. 
A nice, manageable chunk of volume “surrounding” a given x-coordinate is given simply by 
taking an x-interval, containing the given x-coordinate, and multiplying by the cross-sectional 
area, 0.01 m?. Thus, if we’re going to multiply by lengths of subintervals in our Riemann sums, 
then the function that we want to find Riemann sums of, in order to approximate the mass, is 


cross-sectional area times density, i.e., 


f(x) =0.01(114+ 2) kg/m. 


For 0 < x < 1 meter, let M(x) denote the mass of the rod between 0 and z, so that, if 
0<a<b< 1, then M(b)— M(a) is the mass of the rod between x = a and x = b. We obtain a 
table of data that looks like: 
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[zi-1, i] (m) | [0, 0.2] (0.2, 0.4] (0.4,0.6] (0.6, 0.8] (0.8,1] 
s; (m) 0 0.2 0.4 0.6 0.8 
f(si) (kg/m) | 0.01 0.01(1.2) 0.01(1.4) 0.01(1.6) 0.01(1.8) 
AM (kg) | 0.01(0.2) | 0.01(1.2)(0.2) | 0.01(1.4)(0.2) | 0.01(1.6)(0.2) | 0.01(1.8)(0.2) 


Approx. total M = R4&(f) = (0.01)(0.2)(1 + 1.2+1.44+ 1.6 + 1.8) = 0.014 kg. 


Now let’s look at the Riemann sum, where we use the same partition, but we use right 
endpoints of the subintervals as sample points. Thus, we let R = {0.2,0.4,0.6,0.8, 1}, and will 
calculate RE(f), the right Riemann sum of the same function f(x) that we used above. 


[zi1, 2] (m) (0, 0.2] (0.2, 0.4] (0.4,0.6] (0.6, 0.8) (0.8,1] 
8; (m) 0 0.2 0.4 0.6 0.8 
f(s:) (ke/m) | 0.01(1.2) 0.01(1.4) 0.01(1.6) 0.01(1.8) 0.01(2) 
AM (ke) | 0.01(1.2)(0.2) | 0.01(1.4)(0.2) | 0.01(1.6)(0.2) | 0.01(1.8)(0.2) | 0.01(2)(0.2) 


Approx. total M = R&(f) = (0.01)(0.2)(1.2 + 1.4 + 1.6 + 1.8 + 2) = 0.016 kg. 


There is a third common set of sample points that frequently get used: the midpoints of 
the intervals; we would normally denote this set by M, but, in our current example, M is 
already being used to denote the mass. So, we let C = {0.1,0.3, 0.5, 0.7, 0.9}, and call RS&(f) 


the midpoint Riemann sum. We calculate: 


[zi1, 2] (m) (0, 0.2] (0.2, 0.4] (0.4,0.6] (0.6, 0.8) (0.8,1] 
8; (m) 0 0.2 0.4 0.6 0.8 
f(s:) (kg/m) | 0.01(1.1) 0.01(1.3) 0.01(1.5) 0.01(1.7) 0.01(1.9) 
AM (ke) | 0.01(1.1)(0.2) | 0.01(1.3)(0.2) | 0.01(1.5)(0.2) | 0.01(1.7)(0.2) | 0.01(1.9)(0.2) 


Approx. total M = R&(f) = (0.01)(0.2)(1.1+ 1.3 +1.5+1.7+1.9) = 0.015 kg. 


The fact that the midpoint Riemann sum ends up in-between the left and right Riemann 
sums, and is, in fact, the average of the left and right Riemann sums, is not generally true. These 
things are true in this example because our function f is linear, which also causes the midpoint 
Riemann sum to be exactly the value of what we will later call the definite integral of f over [0,1]. 
In the completely general setting, the midpoint Riemann sums can yield worse approximations 
of the definite integral than the left and right Riemann sums. However, generally, we expect 


midpoint Riemann sums to yield better approximations than the left and right Riemann sums. 
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What we would like to see is that, if we use partitions of [0, 1] with arbitrarily small (positive) 
meshes, then the associated Riemann sums of f get arbitrarily close to some value; that value 


must then be the total mass of the rod. 


Suppose that n is a natural number. Divide the interval [0,1] into n subintervals of equal 


length, so that the associated partition is 


Pn = {0,1/n,2/n,...,(m—1)/n, 1}. 


We will calculate the left, right and midpoint Riemann sums of f with respect to P,,, and see 
that all three of them approach the same limit as n — oo. (Here, we are using the limit of a 
sequence, which means that n takes on only integer values. We discussed this in [2]; also see 
Definition ). 


The left sample set is L, = {0,1/n,2/n,...,(n —1)/n}, where s; = (¢—1)/n. The right 
sample set is R, = {1/n,2/n,...,1}, where s; = i/n. The midpoint sample set is C, = 
{(1/(2n),3/(2n),...,(2n — 1)/(2n)}, where s; = (¢ — 1/2)/n = (2% — 1)/(2n). 

We have: 


0.01 | (< 1 (wo.) 1 f< 
me [t) +5 (4) -3 (4) 
i=1 i=l i=1 
n n 1 
As > 1=n, and Soi = mnt) by Corollary we find that 
i=1 i=1 


0.01 pat mney 1 | _ 0.01 Petinen—2) _ 


Re) = 


n n 2 n n 


0.01 PF ; 


Thus, as n — oo, we see that the left Riemann sums approach 0.01(3/2) = 0.015 kg. 


Similarly, we find 
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Thus, as n — oo, we see that the right Riemann sums also approach 0.01(3/2) = 0.015 kg. 


Finally, for the midpoint Riemann sums, we have 


i %-1\ 1 is %—-1\ 1 
C. 
n = —— 01(1 —— 
Ren?) vs 2n ) n a ( 7 2n ) n 


0.01 | 
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Thus, for each n, Ret (f) = 0.015 kg, and so, certainly, as n — oo, we see that the limit of the 
midpoint Riemann sums is also 0.015 kg. 


In differential Calculus, it is extremely helpful to picture instantaneous rates of change 
graphically. This is accomplished by noting that the slope of a secant line yields the average rate 
of change, and then we take limits to arrive at the notion of a tangent line; we then visualize 


the instantaneous rate of change as the slope of the appropriate tangent line. 


Our question now is: can we do something similar for Riemann sums and their 


limits, in order to picture these things geometrically? 


The answer is (as you may have suspected): YES. We discuss this in the example below. 
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Example 2.2.9. How can we graphically represent Riemann sums? As a specific example, how 
can we graphically represent the left, right, and midpoint Riemann sums from Example |2.2.8} 


Recall that the function we were finding Riemann sums of was 


y = f(x) = 0.011142). 


(We shall omit the units throughout this example.) The graph is, of course, a straight line. 


Xx 


1 T T T T T 
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Figure 2.2: The graph of y = 0.01(1+4+ 2). 


Let’s first look at the partition of [0,1] given by P = {0,0.2,0.4,0.6,0.8,1}, and the left 
sample set L = {0,0.2,0.4,0.6,0.8}. How can we visualize R4(f)? We do it in terms of the 


areas of rectangles. 


The Riemann sum is 


RA(f) = f(si)Aai + f(s2)Ave + f(s3)Av3 + f(s4)Arat f(ss)Ats = 


f (0) - 0.2 + f(0.2) -0.2 + f(0.4) - 0.2 + f(0.6) -0.2 + f(0.8) - 0.2. 


As f(x) is non-negative on the interval [0, 1], we can interpret f(s;)Az; as the area of a rectangle 
of height f(s;) and width Ax;; we draw this rectangle over the i-th subinterval on the z-axis. 
Thus, we represent f(0)Axv, = f(0)-0.2 by the rectangle in Figure and the entire left 
Riemann sum is equal to the total area of all five of the inscribed rectangles in Figure [2.4] 
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Figure 2.3: The first left summand. Figure 2.4: The left Riemann sum. 
What about the right Riemann sum? We have 


RE(f) = f(si)Aai + f(s2)Are + f(s3)Axg + f(s4)Ava + f(s5)Ars = 


f(0.2) -0.2 + f(0.4) - 0.2 + f(0.6) - 0.2 + (0.8) -0.2 + f(1) - 0.2. 


Again, we interpret f(s;)Az; as the area of a rectangle of height f(s;) and width Az;. Now, 
however, the corresponding rectangles are above the line y = 0.01(1+ x). Thus, the entire right 
Riemann sum is equal to the total area of all five of the superscribed rectangles in Figure 


In order to compare the left and right Riemann sums visually, we have given both collections 
of relevant rectangles in Figure [2.6] 
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Figure 2.5: The right Riemann sum. Figure 2.6: Combined Riemann sums. 


What happens when we refine our partition into 10 subintervals of equal length, and look at 
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both the left and right Riemann sums in terms of area of rectangles? We obtain the collections 
of inscribed and circumscribed rectangles in Figure 


Oo 
. x 
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Figure 2.7: The rectangles for the refined Riemann sums. 


As you can see, the total areas of the left and right rectangles have gotten closer together, 
that is, the difference between the area of the circumscribed rectangles and the area of the 
inscribed rectangles has gotten smaller; both areas have gotten closer to being the actual area of 
the trapezoid under the graph of f(x) = 0.01(1 +2), and above the interval [0,1] in the a-axis. 


Therefore, we see that, as we refine our partition, the total area of the rectangles 
representing the Riemann sums approaches the actual area under the graph and 


above the closed interval on the z-axis. 

Of course, we know how to calculate the area of a trapezoid; it’s one half the sum of the 
lengths of the bases times the height, where the bases are the two parallel sides. Thus, the area 
of our trapezoid is 


(f(0) + f())-1 = =(0.01+0.02) = 0.015, 


Nile 
Nile 


which agrees with what we found for the limit of our Riemann sums in Example 


We have looked at the Riemann sums graphically only for the left and right Riemann sums. 
What about the midpoint (Figure 2.8) Riemann sums? What about Riemann sums with arbi- 
trary sample sets? 
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Figure 2.8: The midpoint Riemann sum. 


Of course, we can’t sketch rectangles corresponding to every choice that you might make 
for sample points. However, it is true that, since our f(x) is continuous, all limits of 
Riemann sums, with arbitrary sample sets approach the same value, so long as the 
meshes of the partitions approach zero; that “same value”, for a non-negative function, is 
the area under the graph and above the closed interval under consideration. This follows from 
Definition and Theorem [2.3.8 


In fact, for f(z) = 0.01(1 + 2), it is easy to see that, since the left and right Riemann 
sums approach the area under the graph (as we take partitions with arbitrarily small mesh), 
so must Riemann sums using any sample sets whatsoever. Why is it easy to see this? Because 
f(x) = 0.01(14+ 2) is monotonically increasing, which implies that the smallest that f ever gets 
on a closed subinterval is at the left endpoint of the subinterval, and largest that f ever gets on 
a Closed subinterval is at the right endpoint of the subinterval. 


-0.2 0 02 04 06 08 1.0 1.2 


Figure 2.9: Left, right, and midpoint Riemann sums. 
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Thus, for all sets of sample points for a given partition, the rectangles representing the 
Riemann sums are “trapped” between the left and right Riemann sum rectangles. See Figure[2.9] 
where we have included the midpoint Riemann sum, with our previous partition of the interval 


(0, 1] into 5 subintervals of equal length. 


You may be thinking: “Ah, so, we can calculate limits of Riemann sums simply by calculating 
areas, like that of a trapezoid. This is easy.” The problem is that we don’t know the areas under 


the graphs of functions, at least, not by applying easy formulas from basic geometry. 


Suppose, for instance, that our function f has been given, not by a function whose graph is 
a straight line, but rather by 
f(x) = 0.01(1 +2”). 


Then, exactly how do we calculate the area under the graph of this f and over the interval [0, 1]? 


» 


0.02 


0.01 


T T T T T 
-0.2 0 02 04 06 08 %10 1.2 


Figure 2.10: Area under a parabola. 


The answer is that we calculate the limit of Riemann sums to find the area, not the other 
way around. Only in very few cases can we calculate the limit of Riemann sums from known 


formulas for the area. 


Before we leave this example, we should comment on how we would have graphically repre- 


sented the Riemann sums had f(a) been negative, or sometimes positive and sometimes negative. 


Suppose that we have a continuous function f such that f(a) < 0 for all x in the interval 
[0,1]. Then, given a partition P = {xo,...,2%n} of [0,1], and a sample set S = {s1,...,5,} for 
P, the summand f(s;)Az; in the Riemann sum R$(f) cannot be represented by area, since it 
will be a negative quantity. However, we can consider a rectangle under the subinterval [x;_1, x;] 


of height — f(s;), and then f(s;)Az; will be equal to negative the area of this rectangle beneath 
the z-axis. See Figure 


2.2. PRELUDE TO THE DEFINITE INTEGRAL 99 


Thus, the limit of the Riemann sums of our negative function f, as the mesh of the partitions 
approaches zero, will be equal to negative the area above the graph and under the 


interval [0, 1]. 


height = — f(s,) 


Figure 2.11: A graph below the z-axis. 


Finally, what if f is continuous, but is negative sometimes and positive other times? As we 
shall see in Theorem [2.3.16] the limit of Riemann sums of such an f is a sum of contributions 
from where f > 0 and where f < 0 and, thus, in terms, of area, the limit of Riemann sums, as 
the mesh of the partitions approaches zero, is the equal to the area under the graph and above 


the z-axis minus the area above the graph and under the x-axis. 


Thus, for a function f such as that in Figure 2.12 the limit of R$(f), as the mesh of the 
partition P of [0,1] approaches zero, will equal the difference of areas Az — Aj. 


In no way are we claiming that the area under the x-axis and above the graph is negative; 
we are simply saying that the contribution to the limit of the Riemann sums from that 
portion of the graph is —A,, where A, itself is positive. 


Figure 2.12: A graph below and above the z-axis. 


100 CHAPTER 2. CONTINUOUS SUMS: THE DEFINITE INTEGRAL 


2.2.1 Exercises 

In each of Exercises 1 through 5, calculate the mesh of the given partition. 
1. P = {—2,0, 3,5, 7, 9}. 
2. P = {In1,In2,1n3,1n4, In 5}. 
3. P = {0,1/4,1/3, 1/2}. 


4. P = {0,1/2, 2/3, 3/4, 4/5}. 


de 2 2" -—1 
5. P= {o, Sons on i where n is a positive integer. 


6. Explain why x9 = 0, 71 = 2, v2 = —1, 73 = 3, and x4 = 4 cannot be used as a partition. 


For the following true/false questions, in Exercises 7 through 13, assume all parti- 


tions mentioned are on the same interval, (0, 1]. 


7. If ||Q|| < ||P]|, then Q is a refinement of P. 
8. If QO is a refinement of P, then ||Q|| < ||P||- 


9. Given a specific mesh € < 1, where e¢ is a rational number, there exists a partition P of 


(0, 1], which has mesh ¢, where every point of P is a rational number. 
10. If (Q,7) is a refinement of (P,S), then P is a refinement of Q. 
11. If P is a refinement of Q, then (Q,7) is a refinement of (P,S). 
12. The sampled points of a partition must all be distinct. 


13. If f is a continuous function and P is a partition of some interval, then RE" ( f) is always 
between RE(f) and R5(f). 


In each of Exercises 14 through 16, say (a) whether Q is a refinement of partition 
P and (b) whether (Q,7) is a refinement of (P,S). 


14. P = {0,1/4,1/2,1}, Q = {0,1/2, 1}, S = {1/8, 2/3, 3/4}, T = {1/8, 3/4}. 


15. P = {0,2/5,1}, Q = {0,1/3, 2/5, 3/5, 1}, S = {1/3, 2/3}, T = {1/4, 1/3, 2/3, 3/4}. 
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16. P = {0,1/8, 2/5, 3/4, 1}, Q = {0, 1/8, 3/13, 2/5,5/9, 3/4, 4/5, 1}, S = {1/16, 2/5, 5/9, 3/4}, 
T = {1/7,4/21,5/18, 1/2, 8/13, 7/9, 9/10}. 


17. Suppose a car is traveling with variable velocity for three minutes, and that 


e at some point between 0 and 40 seconds, the car is moving with velocity 15 m/s; 

e at some point between 40 and 90 seconds, the car is moving with velocity 20 m/s; 

e at some point between 90 and 120 seconds, the car is moving with velocity 25 m/s; 
e at some point between 120 and 180 seconds, the car is moving with velocity 15 m/s. 


Use Riemann sums to approximate the total displacement of the car from its initial posi- 


tion. 
18. Suppose a car is traveling with variable velocity for one hour, and that 


e at some point between 0 and 12 minutes, the car is moving with velocity 40 miles/hour; 


e at some point between 12 and 30 minutes, the car is moving with velocity -10 
miles/hour; 


e at some point between 30 and 40 minutes, the car is moving with velocity 20 miles/hour; 


e at some point between 40 and 60 minutes, the car is moving with velocity 0 miles/hour. 


Use Riemann sums to approximate the total displacement of the car from its initial posi- 


tion. 
19. Suppose a car is traveling with variable acceleration for 30 minutes, and that 
e at some point between 0 and 10 minutes, the car is accelerating at a rate of 5 miles 
per hour per hour. 


e at some point between 10 and 15 minutes, the car is accelerating at a rate of -3 miles 


per hour per hour. 


e at some point between 15 and 25 minutes, the car is accelerating at a rate of 1 mile 


per hour per hour. 
e at some point between 25 and 30 minutes, the car is accelerating at a rate of 4 miles 


per hour per hour. 


Use Riemann sums to approximate the total change in velocity of the car from its initial 


velocity. 


20. Suppose that a balloon starts with no air in it, and is inflated between t = 0 and t = 90 


seconds. Suppose further that, 
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21. 


22. 


23. 


24. 


e at some time between 0 and 20 seconds, the balloon is inflating at a rate of 2 in?/s. 
e at some time between 20 and 50 seconds, the balloon is inflating at a rate of 5 in®/s. 


e at some time between 50 and 75 seconds, the balloon is inflating at a rate of -3 in3/s. 


Perhaps someone lost their grip on the balloon momentarily. 


e at some time between 75 and 90 seconds, the balloon is inflating at a rate of 4 in?/s. 
Use Riemann sums to approximate the total volume of air in the balloon at t = 90 seconds. 


If f(t) = c on the interval [a,b], prove that the Riemann sum of f on [a,b] using any 
sample set and any partition is c(b — a). Give a physical interpretation of this fact if f is 


a velocity function. 


Let f(x) = mz be a line through the origin defined on [0,k]. Assume m > 0. What is the 
limiting Riemann sum of this function over this interval? Use the fact that Riemann sums 


approximate the area under a curve. Your answer may be non-rigorous. 


Suppose f is a continuous function on [a, b]. Then f achieves its minimum m and maximum 


M. Let P be a partition of [a,b]. Prove that m(b— a) < Rp(f) < M(b—-a). 


Consider the following function with domain (0, 1]. 


1, if x is rational; 
f(z) = vi ahead 
0,, if x is irrational. 


Show that no matter what partition is chosen, and no matter how small its mesh, we can 
always find two sets of sample point S = {s;} and S’ = {s‘} for that partition such that 
the Riemann sum is 1 on S and 0 on S’. 


In each of Exercises 25 through 27, calculate the left Riemann sum for the given 


function and partition. 


25. 


h(a) = —x, P = {—3, —2,—1,0,1, 2, 3}. 


: a T 1 3T 
26. g(x) =sinz, P = {0.5.5 af. 


27. 


fay jeer = (001509) 300,17). 


In each of Exercises 28 through 30, calculate the right Riemann sum for the given 


function and partition. 
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28. k(x) = e? /2, P = {0,1n2, In3, In 4}. eo 
29. I(x) = e~*/2, P = {0,1n 2, In3, In 4}. 
30. j(z) = cosha, P = {0,ln2,1In3,ln4}. Hint: Use the previous two problems and the fact 


that Riemann sums are linear, a fact you will prove in a later exercise. 


In each of Exercises 31 through 34, calculate the midpoint Riemann sums for the 


following functions and partitions. 


a1. y(a) = coon, P={-¥,-55,5} (© 
32. r(x) = 1/a?, P = {1/10, 1/8, 1/5, 1/2, 1}. 
a3. ote) =tan > e, P= {0,1,2,2 A. 


34. h(x) =sinecosz, P = {0,7/2, 7, 37/2, 27}. 


35. Let f(x) = V1-—2x?. Calculate the left Riemann sums for the following partitions. 


a. P, = {—-1,0, 1}. 

b. Py = {—1, —0.5,0, 0.5, 1}. 

c. Ps = {—1, —2/3, —1/3, 0, 1/3, 2/3, 1}. 

d. Based on geometric intuition, what is the limiting Riemann sum, as the mesh ap- 


proaches 0, using partitions of the interval [—1, 1]? 


36. Suppose f(a) is a continuous odd function. What is the midpoint Riemann sum using the 


partition P = {—a3, —a2, —a1,0, a1, d2,a3}, where 0 < a1 < a2 < as? 


The work done on an object experiencing a constant force F', as the object travels 
along the x-axis, from point a to b, is W = F(b—a) (provided that the force acts 


parallel to the z-axis). 


The total work done on an object experiencing a variable force F = F(a) can 
be approximated by Riemann sums. Namely, suppose P = {20,21,...,2n} is a par- 


tition of [a,b] and that S = {s),...,5,} is a sample set for the partition. Then the 
n 

approximate work done is W3(F) = S- F'(s;)(a;-—2;_1). The metric units of work are 
i=1 

Newton-meters, or joules. C 


In each of Exercises 37 through 39, assume an object moves along the interval 
[0,1], that the partition is P = {0,1/4,1/2,3/4,1}, and that the sample set is S = 
{1/4,1/2,3/4,1}; calculate W3(F). 
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37. F(a) = 2°. 


38. F(x) = sin(72). 


39. F(a) = 3a. ie 


1 1 1 
40. Let f(z) = 1/2 on the interval I, = [+, 1], where n = 2,3,.... Let P = 4 —, PT it, 
nn— 
Calculate the left Riemann sum of f over this partition using the left endpoints for n = 2,3 


and 4. 


41. Suppose that f and g are two continuous functions on the interval [a,b]. Let (P,S) be 
a partition and sample set for this interval. Prove that the Riemann sum is linear in the 
sense that 

Ra(cf +9) =¢- Ralf) + Rely) 


where c is any real number. eo 


42. Consider the function, 
=) , f0<a<1; 
0, ifs =0, 


= 
8 
~ 
I 
SaaS 
RIB 
ee 
=) 
——~ 
IA 


and suppose that P is a partition of the interval [0, 1]. 


a. Show that the Riemann sums can be made arbitrarily large, i.e., for all N > 0, there 
exists a sample set S such that RZ > N. 
b. Similarly, show that the Riemann sums can be made arbitrarily small, i.e., for all 


N > 0, there exists a sample set S such that Re < —N. 


43. Consider the function 


3, if0<a<lorl<a<2; 
f(z) = _ 
100 x=1. 


a. Let P, = {0, sr, 5x, ---,2} be the uniform mesh where the distance between consecu- 
tive points is 1/2”. Calculate RZ (f), 


b. Show that lim R# (f) = 6. 


44, Let f(a) =c be a constant function on the interval [0,1] with c > 0. Let 


_ je, ifaA1/2; 
9H) = 4 spe 1/2, 
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Let P,, be the partition on [0,1] where the distance between each sample point is 1/n. 
Show that lim RZ(f) = lim RE(g) =c. 

TUF OO: n— oo 
The idea is that the Riemann sum of a function with a single discontinuity approaches the 


Riemann sum of the same function with the discontinuity removed. 


Suppose that a partition P = {z0,21,...,2,} of the interval [a,b] has been given, and 
that f is continuous on [a,b]. The Extreme Value Theorem tells us that f attains 


maximum and minimum values on any closed subinterval of [a, }]. 


Let U(P) be a sample set chosen in such a way that, for each s; in the sample 
set U(P), f(s;) equals the maximum value of f on [z;,2;4;|. Similarly, let L(P) be a 
sample set chosen in such a way that, for each s; in the sample set L(P), f(s;) equals 


the minimum value of f on [x;, 2;41]. 


45. Show that RO” > RE”. 
46. Show that if Q is a refinement of P, then RG'S) < RE” and REY > RA”. 


47. Show that if P’ is another partition, not necessarily a refinement of P or vice versa, then 
REP) < RUPY, 


We can use Riemann sums to approximate lengths as well as areas. Suppose we’d 
like to approximate the length of the graph of a function f(x) on the interval [a, }]. 
If P = {x0,21,...,¢n} is a partition of the interval [a,b], then we can measure the 
lengths of the line segments connecting the points (x;_1, f(vi-1)), (vi, f(a;)). Recall 
that this length is given by \/(x; — x;_1)? + (f(a) — f(ai_1)). 


48. Let f(a) = V1 —- 2?. 


a. Consider the partition P; = {—1,0,1} of [—1,1]. Approximate the length of the 
graph by calculating 


b. Approximate the length of the graph, via the same technique, but using the refined 
partition P2 = {—1, —1/2,0,1/2, 1}. 


c. Based on classical geometry, what is the length of this graph? Are your approxima- 


tions close? 
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In each of Exercises 49 through 51, use the technique in the previous problem to 


approximate the lengths of the graphs over the specified interval using the given 


partition. 
49. g(x) =sing, [0,7], P = {0, 7/4, 7/2, 37/4, 7}. 
50. A(z) =27,:[0,.1),.P =A0, 1/4, 1/233/74,11. 
51. j(w) = 1/e, [1,5], P = {1,2,3,4, 5}. 
52. Given a partition {xo,...,U%n} of [a,b], we’ve been using 


to approximate the length of the graph of f on the interval [a, b]. This doesn’t look like a 


Riemann sum, but we can “fix” it. 


Prove that, if f is differentiable on an open interval containing [a,b], and no two z;’s are 


equal, then there exists s; in the open interval (x;-1,7;) such that 


n 


x JV (wi — Bi-1)? + (F(@s) — Fai)? = S- (v1 + [FGIP) Ag. 


i=l 
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2.3. The Definite Integral 


As we saw in the previous section, the process of taking Riemann sums, and using partitions 
with meshes that get arbitrarily small, arises in a number of different contexts. We might be 
given the velocity of an object over an interval of time, and want to find the displacement. We 
might be given the rate at which a balloon is being inflated over an interval of time, and want to 
know how much the volume of air in the balloon changes. We might be given a rod of variable 
density, and want to know its total mass. We might want to know how much area is “trapped” 
between the graph of a function and the z-axis. 


In this section, we define the definite integral as the limit of Riemann sums, and discuss some 
of the basic properties of definite integrals. We do not give many applications of definite integrals 
here; such applications will be the sole topic of the entire next chapter, Chapter [3] It is also 
somewhat difficult to look at serious applications before we have the basic tool for calculating 
definite integrals, the second part of the Fundamental Theorem of Calculus, Theorem [2.4.10] 


Many of the proofs in this section are extremely technical and we have deferred them to the 
Technical Matters section, Section 


In Example [2.2.8|and Example[2.2.9]in the previous section, we saw that our Riemann sums 
SOL, f(si)Av; approached a limit as we let the mesh of the partition approach zero, but we 
used special partitions and sample sets; we used subintervals which all had the same length, and 
we used the left and right Riemann sums. What we would like to know is that other manners of 
choosing partitions and sample sets would have approximated the same quantities, that is, we 
would like to know that we could have used arbitrary partitions and arbitrary sample sets, and 
still obtained the same limit, so long as the meshes of the partitions approach zero. Technically, 
this means that we would like for there to exist a limit Z that our Riemann sums get arbitrarily 
close to (within ¢, for arbitrary e > 0) if we make the mesh of our partitions small enough (less 
than some 6 > 0, where 6 would typically depend on how e was chosen). 


Thus, we make the definition below, though it should not be clear at this point that 


many/any functions satisfy the strong requirements. 
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Definition 2.3.1. (The Definite Integral) Suppose that f is defined on the closed interval 
[a,b], where a < b, and that there exists a real number L such that, for alle > 0, there exists 
6 > 0 such that, for all partitions P = {xo,...,2n} of [a,b], with mesh less than 6, and for 
all sample sets S = {51,...,5n} for P, 


So f(siAai — Lih<e. 
i=1 


Then, we write that 


and we say that f is Riemann integrable on [{a, }]. 


When f is Riemann integrable, the limit above is usually written 


| For 


and is called the (definite) integral of f on [a,b], or the integral of f(x), with respect 
to x, as x goes from a to b. 


In this context, f is referred to as the integrand, and a and b are the limits of integration. 


Remark 2.3.2. There are several points that we need to make now. 


First, you should understand the point of the definition of the integral. A function is Riemann 
integrable if and only if the Riemann sums )>;"_, f(s;)Ax; approach a specific limit, regardless 
of how you choose the partitions and/or sample points, as long as you take small enough 


subintervals in your partitions. You are allowed to use partitions in which the subintervals have 
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different lengths and choose sample points randomly. For f to be Riemann integrable means 
that none of this matters, as long as the mesh of the partitions approaches 0. 


It’s the strongest property that you ask for, given our discussion in the previous section, and 
it’s a little difficult to believe that many interesting functions satisfy such a strong condition, 
but we shall see that most of the functions with which you’re familiar are, in fact, Riemann 
integrable. (We are deliberately leaving you in suspense for the moment about which functions 


those are.) 


We should also mention that it is customary in Calculus textbooks to drop the modifier 
“Riemann” from the term “Riemann integrable”, and simply say that a function is “integrable” , 
whenever they mean “Riemann integrable.” The justification given for this is always that 
“Riemann integrability is the only type of integrability that will be used in this textbook.” 
Strangely, this is not usually the case; every, or almost every, Calculus textbook includes a 
discussion of improper integrals (see Section [2.5). 


Improper integrals specifically involve defining an integral i f(x) dx in certain cases where 


f is not Riemann integrable on [a, 6]. 


For this reason, we shall not drop the modifier “Riemann” from “Riemann integrable.” We 
shall reserve the simpler term “integrable” for functions on intervals for which the Riemann 


integral exists or for which the improper integral exists. 


It is important to note that the units on the integral are the units of the Riemann sums, i.e., 
the units of f times the units of x. 


b 
Finally, you should realize that the variable x, which appears in / f(x) dx, is a dummy 


a 
variable, which means that this variable name is irrelevant in the determining the actual value 


of the integral. For instance, we have the following equalities: 


[sou = [soa = [ soa 


All of these integrals mean the same thing: you partition the interval [a, b] into arbitrarily small 
subintervals, you evaluate f at a sample point in each subinterval, you multiply the values of f 
at the sample points times the lengths of the respective subintervals, you add, and you take the 
limit of totals that you get, as the mesh of the partitions approaches zero. The point of writing 
all of that, without referring to a variable, was to make it clear that it is irrelevant what the 


variable is. 


However, it is true that, in applications, there will typically be variables of physical signifi- 


cance, like t for time or x for position, and it would, in fact, be a little odd to change the variable 
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name when writing integrals related to these quantities. 


The fact that Definition defines the Riemann integral as one single thing follows from 
the standard type of proof (see Theorem |2.A.1) that limits are unique; thus, there aren’t two 
(or more) different limits LZ; and Lz that satisfy the e-d condition that we required. We state 


this as a theorem. 


Theorem 2.3.3. The limit of Riemann sums, the definite integral, given in Definition[2.3. 1] 
is unique, if it exists. 


In addition, if f is Riemann integrable on the interval [a, b], i f(a)dx = L, and 
(Pn,;Sn) is a sequence of sampled partitions of [a,b] such that lim ||P,|| = 0, then 


lim RE" (f) 


n—co 


Well, this is all great, but do we know any functions which are Riemann integrable? 
For that matter, do we know any functions which aren’t Riemann integrable? How you calculate 
definite integrals, when they exist, is a separate question, and our best answer to that will have 
to wait until Section [2.4] but, for now, we’d at least like to know some results on when integrals 


exist and when they don’t. 


There is an easy, fundamental, criterion which implies that a function is not Riemann 
integrable. If a function is arbitrarily large in absolute value on an interval, then the function 


will not be Riemann integrable. We give a careful definition before stating this result as theorem. 


Definition 2.3.4. A set E of real numbers is bounded if and only if there exist real 
numbers p and q such that, for alla in E, p< a<q. Equivalently, E is bounded if and 
only if there exists a real number M > 0 such that, for alla in FE, -—M<a< M, ie., 
|x| << M. 


Suppose that f is a real function and that a set E is contained in the domain of f. We say 
that f is bounded on E if and only if the set of values of f on E is bounded, i.e., there 
exists M > 0 such that, for all x in E, |f(x)| <M. 


If f is not bounded on E, then we say that f is unbounded on E. 
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Example 2.3.5. As a particular case of a bounded set, note that closed, bounded intervals are 
intervals of the form [a, 6]. 


As an example of an unbounded function on a closed, bounded interval, consider 


if c =0; 


0, 

=1 
M91 ey ee <1. 
H i 


This function is unbounded on [0,1], since f becomes arbitrarily large as x approaches 0 from 
the right. 


On the other hand, the Extreme Value Theorem (see [2], or [4]) tells us that every contin- 


uous function on a closed, bounded interval is bounded. 


The most basic way in which a function can fail to be Riemann integrable is given by: 


Theorem 2.3.6. If f is unbounded on the interval {a, b], then f is not Riemann integrable 


on [a, 6]. 


Proof. Suppose that f is unbounded on [a,b], and let P = {xo,...,@n} be a partition of [a, b]. 
Then, f must be unbounded on at least one of the subintervals [7;_1, 2;]. Therefore, by changing 
the sample point s; in this subinterval, you can make |f(s;)Az,;| arbitrarily large. This means 
that, as the mesh of the partitions approaches 0, the Riemann sums do not approach a limit that 


is independent of how the sample points are chosen, i.e., the definite integral does not exist. 


Example 2.3.7. Theorem tells us immediately that the function f(x) from Example 


is not Riemann integrable. 
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Alright. Now we have a basic way in which functions can fail to be Riemann integrable. But, 
we want a theorem that tells us that lots of functions are Riemann integrable. There is such a 
theorem. Informally, what it says is that a function on a closed, bounded interval is Riemann 
integrable if and only if the function is bounded and the set of points where the function is 
discontinuous is “small”. The technical requirement for “small” is that the set of points where 
f is discontinuous has to have measure zero. A discussion of this result, in full generality, is 
beyond the scope of the textbook. See [4] and [3]. 


However, all of the functions that we shall want to integrate in this book will either be 
continuous, or have a finite number of discontinuities; the good news is that the empty set and 


finite sets of points definitely have measure zero. 


Thus, we have: 


Theorem 2.3.8. Bounded functions on closed bounded intervals, which have, at most, a 
finite number of discontinuities, are Riemann integrable. 


In particular, as continuous functions on closed, bounded intervals are bounded, all con- 
tinuous functions on closed, bounded intervals are Riemann integrable. 


Proof. We give the proof in Theorem 


It is convenient to give a name to functions which may have, at most, a finite number of 


discontinuities. 


Definition 2.3.9. A real function f is piecewise-continuous on an interval I provided 


that f is defined on I and is continuous at all, except (possibly) a finite number of, points 
in I. In particular, a continuous function is also piecewise-continuous. 


Example 2.3.10. The function 


if0<2<0.1; 


0, 

r)= 1 
g(a) —, if0l<a<l1 
x 


is piecewise-continuous and, hence, Theorem [2.3.8] tells that g(x) is Riemann integrable. 
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It is interesting to compare this with the function f(x) from Example [2.3.5] 


0, ifa=0; 
PON. ee esse, 
x 
ry ty 
oP 05 ix or 05 ix 
Figure 2.13: Graph of y = f(z). Figure 2.14: Graph of y = g(a). 


While the graphs of f and g may appear to be similar (or maybe they don’t to you), it is 
important to keep in mind that unbounded functions are very different from bounded piecewise- 


continuous functions. 


Before stating more theorems, we should translate our discussion about area in Example[2.2.9| 
into a proposition about definite integrals, a proposition which tells us how to visualize integrals 
in terms of area. This proposition could, instead, be used as a definition for area below or 
above graphs of certain types of functions; we choose to believe that you have a preconceived 
notion of area, and that the following proposition tells you rigorously what that area equals in 


terms of limits of Riemann sums, i.e., in terms of definite integrals. 
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Proposition 2.3.11. Suppose that a <b, and f is Riemann integrable on [a,b]. 


1. If f >0 on [a,b], then if f(x) dx is equal to the area under the graph of y = f(x) and 
above the interval [a, b]. 


I f(x)] dx is equal to the area under the 


2. If f <0 on [a,b], then — f° f(x) dx =f 


interval [a,b] and above the graph of y = f(x); thus, L f(x) dx is equal to negative 
the area under the interval [a,b] and above the graph of y = f(x). 


Figure 2.15: Integral of a positive function is Figure 2.16: Integral of a negative function is 
area under the graph. negative the area above the graph. 


Be careful: the area under the z-axis and above the graph is NOT, itself, negative; it’s 
positive, as area always is. The point is that, to interpret the integral of a negative function 
in terms of area, you negate the positive area above the graph, which, of course, yields a 
negative number. 


A special case of Proposition |2.3.11}is when y = f(a) = k, where & is a constant. 


Proposition 2.3.12. Suppose that a < b, and k is a constant. Then, the function f(x) =k 
is Riemann integrable and 


[eae = k(b—a). 
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Proof. Of course, constant functions are continuous and, hence, Riemann integrable by Theo- 
rem However, our proof of the formula will also prove that constant functions are Riemann 


integrable. 


We shall simply show that every Riemann sum of f(x) = k, regardless of the partition P 
and the sample set S, has R}(f) equal to precisely k(b— a). Thus, L = k(b—a) clearly satisfies 
the conditions for L in Definition [2.3.1 


Let P = {x0,%1,...,%n} be a partition of [a, b], and let S = {51,...,5,} be a sample set for 
P. Then, 


Re(f) = San = Ss BAN; = ESS Aas 
i=1 i=1 i=1 


and this last summation telescopes to yield x, — xp = b— a. 


y=k>0 
y A 
y 
0 
x 
0 a b 
x 
Area = k(b-a) 
y=k<0 
Area = (-k)(b-a) 
Figure 2.17: Area under y = k > 0. Figure 2.18: Area above y=k < 0. 


Remark 2.3.13. The definition of the definite integral looks very technical. Thinking in terms 
of area can help you to see things in some cases, but many physical problems, which don’t involve 
areas, deal with subdividing, taking Riemann sums, and taking limits. So, more generally, you 


may be wondering how you should think about 


b n 
[ tea = lim RE) = tim | > Alsi). 


\|P \i0 
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One possible way to think about definite integrals — a way that we shall use extensively through- 
out this textbook — is informal, but is, nonetheless, extremely useful. We discuss the problems 


infinitesimally. 


If we take a partition with mesh less than some positive 6, then all of the Az;’s are less than 
6, and any 2-coordinate in the interval [a,b] is within 6 of one of the sample points. Thus, as 
the meshes approach zero, the sample points get arbitrarily close to each point in the interval 


[a, 6], and the subinterval(s) containing a given point become(s) arbitrarily small. 


Thus, we think, intuitively, that, if we select an x in [a,b], as the mesh of the partitions 
approaches 0, and we look at 5"_, f(s;)Az;, in the limit, one of the summands “becomes” f(z) 
times an infinitesimal change in x, represented by dx. That is, in the limit, we think that, for 
each x in the interval [a,b], one of the f(s;)Aav; approaches the infinitesimal summand f(x) dz, 


b 
and the summation )>;_, becomes the “continuous summation” J. . 


While we always keep in the backs of our minds that we are really using partitions, sample 
points, Riemann sums, and taking limits, in many physical applications it is very intuitive, and 
time-saving, to think of the definite integral as the continuous sum of infinitesimal 


contributions, and to analyze problems using this informal terminology. 


For instance, we arrived at Proposition [2.3.1] by looking back at our discussion in Exam- 
ple [2.2.9] and using the definition of the definite integral in Definition [2.3.1] Thus, we consider 
area as a limit of Riemann sums. But now, let’s discuss area in terms of continuous sums of 
infinitesimal contributions. Hopefully, this will seem more intuitive to you, but, keep in mind, 


that the real definition of the integral is the limit of Riemann sums. 


Consider a function f(a), defined on the closed interval [a,b], where a < b, and, for now, 
assume that f(x) > 0, for all x in [a,b]. If we assume that f is Riemann integrable on |a, db], 
how do we picture Fi f(a) dx? 


y y 
F(x) fx) 
x * 
er b : 


Figure 2.19: A rectangle with infinitesimal Figure 2.20: A magnified infinitesimal rectan- 
width and area. gle. 
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Pick some x-coordinate between a and b, and draw a rectangle of height f(a) and a very 
small width, which we think of as the infinitesimal dx; the infinitesimal subinterval of width 
dx should include w. See Figure and Figure [2.20] which show the type of thin rectangle 
that you might draw for yourself, in order to represent a rectangle of infinitesimal width and 
area, together with a “fancier” illustration of a really (think: “infinitesimally”) thin rectangle 
being magnified. Intuitively, we think “the only x-coordinate in the infinitesimal subinterval is 


” 


xz.” Why? Because if there were some other x-coordinate, say , in the subinterval, then the 
subinterval would have to have length equal to at least |a—|, and so would not be infinitesimal. 
Our infinitesimally wide rectangle has infinitesimal area dA, which equals the height times the 


width, ie., dA = f(x)dz. 


Now, to obtain the total area under the graph of f and above the interval [a, b] on the x-axis, 


we simply take the continuous sum of all of the infinitesimal areas as x goes from a to b. Thus, 


[- dA = [se de. 


Note that, on the first integral above, we had to include “x =” in the limits of integration; 
by default, the limits of integration refer to the variable that you have d of, that is, the 
variable in the differential. So, if we had not included “a =” in the limits of integration in 


the total area is 


1) 


ae dA, then it would have meant that A was going from a to b, not x. As we have dx 


in the righthand integral above, we do not need to explicitly state in that integral that the 
limits of integration refer to x. 


Understand — we are not claiming that we have given actual definitions that make what we 
have written mean anything rigorous; we are merely trying to get you used to this manner of 


intuitive thinking, in terms of continuous sums of infinitesimal contributions. 


Of course, this is precisely what we concluded in Example [2.2.9] except there we didn’t call 
it the integral; we simply said the area under the graph was the limit of the Riemann sums. 
That’s what we are saying here too, it’s just that we have new terminology, the definite integral, 
together with our new notation for the integral, and instead of explicitly adding up Riemann 


sums, we talk about taking a continuous sum of infinitesimal contributions. 


What do we do when f(a) < 0 (and is still Riemann integrable)? We draw a picture like 
that in Figure [2.19] except that now the infinitesimal rectangle lies under the z-axis and above 
the graph; thus, the rectangle looks like that in Figure [2.1]| except that now we think of the 
width as the infinitesimal dx and the height is — f(x). Therefore, the total area A between the 
x-axis and the graph of y = f(x) is 


A= fo dh = [-1@« - - f se) ae, 
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and, hence, His f(a) dx = —A. (Note that we “cheated” a bit here, and used that we could “pull 


out” the negative sign in the integral; we won’t really know that this is “legal” until we have 


Theorem |2.3.19}) 


Finally, what do we do when f is Riemann integrable, and f(a) > 0 on some (finite number 
of) closed subintervals in [a,b], and f(#) < 0 on (a finite number of) other closed subintervals 
in [a,b]? In this case, as we discussed at the end of Example ft f(x) dx is a sum of 
the contributions from where f > 0 and where f < 0, which means that Hs f(x) da can be 
interpreted as the area under the graph of y = f(a) and above the interval [a, b] minus the area 
above the graph and under the interval [a, b]. (Again, we “cheated” a bit here, and used that we 
could split the interval over [a, b] into the sum of integrals over subintervals whose union is [a, }]; 
we won’t really know that this is “legal” until we have Theorem and/or Theorem|2.3.18}) 


We now need to state a number of theorem about definite integrals. Despite the fact that 
all of the examples of Riemann integrable functions that we shall look at will be bounded, 
piecewise-continuous functions, the remaining theorems of this section are true for completely 
general Riemann integrable functions, and it’s just as easy to state and prove them in that 


generality; so we will. 


The following theorem says that, as far as integration is concerned, you can “ignore” what 


happens at a finite set of points in the domain. We give the proof in the Technical Matters 


section; see Theorem 


Theorem 2.3.14. Suppose that f and g are defined on a closed interval [a,b], and that, 
except possibly for a finite set points in [a,b], f and g are equal at each point in [a,b]. 
Then, f is Riemann integrable on [a,b] if and only if g is, and when f and g are Riemann 


integrable, 


[re dx = [9 de. 


Example 2.3.15. The function given by 


7, ifl<a<38o0r3<2<5; 
f(x) = aa: 
4, ife=3 
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is equal to the function that is constantly 7 on the interval [1,5], except at the point x = 3. 


Thus, by Theorem |2.3.14 
5 5 
| OCs =| Vda = 715-1) = 28, 
1 1 


where the next-to-last equality follows from Proposition |2.3.12 


In terms of area, it seems reasonable that what happens at a finite number of points shouldn’t 
affect the integral, since the area under (or over) a single point and above (or below, respectively) 


the z-axis should be zero, and so does not change the area being considered. See Figure [2.21] 


Figure 2.21: Area under a point is zero. 


The following theorem is about subdividing, or splitting, the interval over which you’re 
integrating. Thinking in terms of continuous sums, Item 1 of the theorem says that, if the 
continuous sum on a bigger interval is defined, then so is the continuous sum on any smaller 
subinterval; Item 2 simply says that the sum of the infinitesimal contributions over all x in 
the interval [a, b] is equal to sum of those contributions as x goes from a to some intermediate 


x-coordinate c plus the sum of the contributions as x goes on from c to b. 
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Theorem 2.3.16. Suppose that a < b. 


1. If f is Riemann integrable on [a,b] anda<c<d<_b, then f is Riemann integrable on 
[c,d], i.e., if f is Riemann integrable on a given closed interval, then f is Riemann integrable 
on any closed subinterval of the given interval. 


2. Suppose thata <c <b. Then, f is Riemann integrable on [a,b] if and only if f is 
Riemann integrable on [a,c] and [c,b] and, when these equivalent conditions hold, 


[seve = [ toa + [ seve 


Proof. See Theorem [2-A-6]in the Technical Matters section, Section [2.A] 


For a non-negative, Riemann integrable function f, Item 2 of Theorem |2.3.16] is easy to 
picture in terms of area; see Figure 


y=f(x) 


Figure 2.22: Area over [a, b] equals area over [a,c] plus area over |[c, }]. 


You may wonder about counting the line segment at x = c once in ft f(x) dx versus twice 
in the sum of integrals from a to c and from c to b, but, remember: the area of a line segment 


is 0, and adding it or subtracting it has no affect on the area calculation. 
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The formula in Item 2 of Theorem [2.3.16] is so useful that we would like for it to be true 
regardless of what inequalities (or equalities) are satisfied by a, b, and c. For instance, we would 


like for the formula to hold if a= c. This means that we must have 


[soe = Preoaw+ f sea, 


and so, we would need for f” f(x) dx to equal 0. 


Also, if a = b, we should have 


[ sevex = [ sear ‘ [ seve 


If [° f(x) dx = 0, then we need for f% f(x) dx to equal — [% f(x) de. 


Therefore, we make the following definitions: 


Definition 2.3.17. Ifa is in the domain of f, we say that f is Riemann integrable on the 
interval [a, a] and define 


[ sees =i 


If a < b, and f is Riemann integrable on [a,b], then we define 


"tp ae ” F(a) de. 
b a 


With the definitions above, and given Theorem |2.3.16} it is easy to check that 


Theorem 2.3.18. Suppose that f is Riemann integrable on a closed interval containing a, 
b, and c, then 


[ioe = fi@e+ fr@w 


regardless of the order of a, b, and c. 
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The following two theorems describe fundamental properties of integration, and are true 


essentially because the corresponding results for Riemann sums are true. 


Theorem 2.3.19. (Linearity of Integration) Definite integration over a closed interval 
is a linear operation, t.e., if f and g are Riemann integrable on [a,b], then, for all constants 
r and s, the function rf + sg is Riemann integrable on [a,b], and 


b 


a (rf (x) + sg(x)) da = r [soya fe / g(x) de. 


Proof. This proof actually splits into two distinct parts; that the integral of a sum is the sum 
of the integrals, and that you can move constants, multiplied in the integrand, outside of the 


integral. 


The statement about constant multiples is easy, and follows from the fact that constants 
distribute over Riemann sums; we leave this part of the proof as an exercise, Exercise [52| The 
fact that the integral of a sum is the sum of the integrals is harder; we prove this in Theorem[2.A.7| 
in the Technical Matters section, Section [2.A] 


If you think of integrals in terms of areas, the following theorem seems fairly obvious; see 
Figure |2.23 


Theorem 2.3.20. (Monotonicity of Integration) If f and g are Riemann integrable on 
the interval [a,b] and, for all x in [a,b], f(a) < g(x), then 


| “(Gin = i see 


Proof. The inequality f(a) < g(x) implies that the corresponding inequality holds on the Rie- 
mann sums for each sampled partition. As we are assuming that f and g are Riemann integrable, 


it is easy to conclude that the integral of f on [a,b] is less than or equal to the integral of g. We 
give the details in Theorem 
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Figure 2.23: Comparing areas under the two graphs. 


Theorem 2.3.21. If f is Riemann integrable on the interval [a,b], then so is |f|, and, 


ii “ede | “[#@)|ae. 


Proof. The proof that f being Riemann integrable implies that |f| is Riemann integrable is very 


technical; we refer you to Theorem 3.3.5 of [4]. Assuming this fact, the given inequality follows 


as a corollary to Theorem |2.3.20}and Theorem |2.3.19 


To see this, note that —|f(x)| < f(x) < |f(a)|.. Applying Theorem |2.3.20] and Theo- 
rem |2.3.19| we obtain that —|/| is Riemann integrable on [a,b] and 


a OL < [sou = [ele 


The inequality in the theorem is now immediate. 


Now, consider the continuous function f(x) = x on a closed interval [a, }}. Theorem [2.3.8] 


tells us that ; 
/ a dx 


exists. Given that the integral exists, Theorem tells us that we may calculate the value 


of the integral by using the limit of Riemann sums over any sequence of sampled partitions, as 
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long as the meshes of the partitions approach zero. 


So, we will let P,, be the partition of [a,b] into n subintervals of equal length, i.e., letting 
Az = (b—a)/n, 


Pn = {a,a+ Aa,a+ 2Az,...,a+(n—1)Aa,a+nAz}, 


where x; = a+ iAz and, of course, b=a+nAz. 


We will use the right Riemann sums, so that our sample points are also given by s; = a+iAz. 


Let’s denote the corresponding Riemann sum by simply R,. For f(#) = x, we obtain 


n 


Ry = dD flsiAxi = S- [(a+iAzx)- Ag]. 


i=1 
Using linearity of summations, Proposition |2.1.4| and Item a from Corollary |2.1.11} we find 


Rn = ada S~1 + (Ass 3 = anAr + (agi? ETD) = 


; ; 2 
t=1 w=1 


Hence, we have shown 


Proposition 2.3.22. 
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Note that, even though we assumed a < 0 in our proof of Proposition|2.3.22| it follows from 
Definition |2.3.17| that the proposition above remains true when b < a. 


Example 2.3.23. Using the area interpretation of the integral, it is easy to derive Proposi- 
tion |2.3.22] geometrically. 


First, suppose that b > 0. Then, the integral i. xdzx equals the area under the graph of 
y = x and above the interval [0,b] on the z-axis. This is the area of a triangle of width b and 
height b. 


Figure 2.24: Area under y = x. 


Thus, 


b 2 
b 

[var = =. 
0 2 


This formula holds even if b < 0 for, in that case, 


and ft x dx is equal to negative the area below the interval [b,0] and above the graph of y = 2; 


this area is again b?/2. 


126 CHAPTER 2. CONTINUOUS SUMS: THE DEFINITE INTEGRAL 


Figure 2.25: Area above y = 2. 


Hence, 


b 0 2 2 
b b 

cds = - f ods = ( ) = . 
» 2 2 


Of course, it follows that 


and so, using Theorem |2.3.18] we find 


b 0 b a b a2 b2 
/ adx = / adx + i rdx = -| adx + i Ge so, 
a a (0) 0) (0) 2 2 


which agrees with what we obtained in Proposition |2.3.22 


Example 2.3.24. Now that we have Proposition]2.3.22}and Proposition|2.3.12} we can combine 


them with linearity to calculate 
b 
/ (ma + c) dz, 
a 
where m and ¢ are constants. 


We find 


b b b Pana? 
[nes oae = m | vde + f cde = m( 5 ) + c(b—a). 
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Example 2.3.25. Consider the integral ie |x| dx. 


Fe Ny Ww BF uM DA A 


x 


bh kb & LO 


Figure 2.26: Area under y = ||. 


By Theorem |2.3.16| we have 


6 ) 6 
/ |xz|dz = / |x| da + ‘ |x| da. 
-3 -3 0 


Why split up the integral like this? Because |x| = a, if x > 0, and |a| = —2, if « < 0. Thus, 
|x| = —a on the interval [—3, 0], and || = x on the interval [0,6]. Therefore, we obtain 


6 0 6 0 6 
/ |ja|dx = / |x| da + | \a|dx = i —xdxz + : edz = 
-3 -3 0 -3 ) 
0 6 2_ (_9)2 2_ 92 
-f vae+ fede = -(—=E*) et er aa 
iz ‘ 2 2 27 5 2 


Note that this agrees with what you would get from interpreting the integral in terms of 


area; see Figure |2.26 
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Example 2.3.26. The cosine function is continuous and, for all z, —1 < cosa < 1. If we want 
bounds on the integral of cosx, we can apply Theorem |2.3.20| twice. 


For instance, even though we do not yet know how to calculate i * cos x dx, we can conclude 


that 
m/2 w/2 m/2 
oo 7 —-ldz < / cosadx < | ldx = us 
2 ) 0 0 2 


In fact, as we shall see later, i cos x dx = 1, which is, indeed, between —7/2 and 7/2. 


Proposition 2.3.27. 


Proof. We will prove that, if b > 0, then fo x? dx = b?/3. The case where b < 0 is similar, and 
we leave it as an exercise. Once you know that fo x* dx = b°/3, the proposition follows from 


applying Theorem |2.3.18 


So, assume that b > 0. How do we show that i x? dx = b?/3? As x? is continuous, we know 
that the integral exists, and so we may calculate it by taking a limit of Riemann sums, in which 
the meshes of our partitions approach 0. This was how we derived Proposition |2.3.22 


Let P,, be the partition of [0, b] into n subintervals of equal length, i.e., the partition in which 
x; = ib/n, and Ax; = b/n. If we once again use right Riemann sums, our sample set R,, is the 


one in which s; also equals ib/n. 


Our corresponding Riemann sum for 2? is 


Py i. 
im = (F) c= Be 


2.3. THE DEFINITE INTEGRAL 129 


By Item (b) of Corollary |2.1.11] we know that 


ye _ n(n +1)(2n+1) 
- = 
i=l 


and so 


3 3 
R, = ep ae ae 2 b (14 -) (2+-). 
n 6 6 n n 


Therefore, as n > 00, Rn — (b?/6)(2) = b?/3 and, hence, fo x? dx = b?/3, as we wanted to 


show. 


Example 2.3.28. The area interpretation of the definite integral lets us calculate the definite 
integral of f(x) in cases where the region between the z-axis and the graph of y = f(x) consists 


of rectangles, triangles, and trapezoids. 


Figure 2.27: Area under y = Vr? — x?. 


Of course, we also know that the area inside a circle of radius r is tr?, which enables us to 


| Vr? — x7 da, 
0 


where r > 0 is a constant. This integral represents the area inside the first quadrant quarter of 


calculate integrals like: 


a circle of radius r, centered at the origin; see Figure [2.27] 


Thus, 


Tr 2 
| Vr2—22dx = aa 
0 4 


130 CHAPTER 2. CONTINUOUS SUMS: THE DEFINITE INTEGRAL 


Example 2.3.29. Let’s return to the rod of varying density in Example|2.2.8) and discuss the 
problem of calculating its total mass, but now we shall use our “continuous sum of infinitesimal 


contributions” language. 


A circular rod, of length 1 meter, and cross-sectional area 0.01 m? (i.e., of radius 0.1/./7 
meters) is lying along the z-axis. 
x=0 
I 


0 a 


¢ | meter > 


Figure 2.28: A rod of varying density. 


For all x such that 0 < # < 1, at each point in the cross section of the rod at x meters, the 
density of the rod is 6(2) = (1+ 2) kg/m?. What is the total mass of the rod? 
Solution: 


We shall present the solution using the language of infinitesimals. Consider the infinitesimal 


amount of volume dV lying along the infinitesimal subinterval at x of infinitesimal length dz. 


x=0 
J 


¢ | meter > 


Figure 2.29: An infinitesimal interval, volume, and mass. 


The infinitesimal volume dV is just equal to the cross-sectional area times the length dz, i.e., 


dV = 0.0ldz m’, 


and the infinitesimal mass dM of this infinitesimal volume is simply the density at x times the 


(infinitesimal) volume dV, i.e., 


dM = d(4)dV = (14+2)0.01dz kg. 
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Finally, the total mass M of the rod is the continuous sum of all of the infinitesimal masses as 
x goes from 0 to 1 meter, i.e., 


xz=1 1 
M = | dM = i (1+2)0.01dx kg. 
x 0 


Suppose that a < b. We now want to develop a reasonable notion of “the mean, or average, 
value of f on the interval [a, }].” 


What should the average value of f mean? Well...if there were a finite number of f-values, 
we would simply add up all of the values and then divided by the number of values; this is what 
is normally meant by the “average” or “mean”. The problem, of course, is that we have an 


infinite number of values f(x), one for each x in the interval [a,b]. How do we deal with this? 


As you may have guessed, we take the average of a finite number of f-values, and then take 
a limit. Let n > 1 be an integer. Subdivide the interval [a, b] into n subintervals of equal length 
Aa = (b—a)/n. As usual, in the i-th subinterval, we pick a sample point s;. We may now take 
the average value of f at the n sample points; we obtain 


So, what would be a reasonable notion of the average value of f on the entire interval? We 
can try taking the limit of the average above, as n approaches oo, but why would this limit 
exist, and what does it have to do with integration? 


Things become more clear if we multiply the numerator and denominator of the above 
fractions by Ax. Then, we obtain 


pCa: S > f(si)Ae 
i=l i=1 
nAx b-—a 


If f is Riemann integrable on [a, b], then, as n — oo, this last numerator approaches f° f(x) dz, 
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and so, the limit of the average value of f at the sample points approaches 


b 
—/ f(x) da. 


Therefore, we make the following definition: 


Definition 2.3.30. Suppose that a < b and that f is Riemann integrable on [a,b]. Then, 
we define the mean value, or average value, of f on [a,b] to be 


freee 


For continuous functions, we have the following theorem about the mean value of /. 


Theorem 2.3.31. (Mean Value Theorem for Integration) Suppose that f is continu- 
ous on the closed interval [a,b]. Then, there exists c in [a,b] such that 


b 
/ f(a) de = (b-a)f(c). 


Thus, if a < b, there exists c in the interval [a,b] such that f(c) equals the mean value 
of f on [a,b], i.e., f attains its mean value on [a,b] at some point in [a, b). 


Proof. If a = b, then the theorem is obviously true. So, assume that a < b. 


As f is continuous on [a,b], the Extreme Value Theorem (see [2]), tells us that f attains a 


global minimum value m on [a,b] and a global maximum value M on {a, }]. 


Thus, for all x in [a,b], m < f(x) < M. By Theorem |2.3.20| this implies that 


m(b—a) = [ne < [soa < [ue = M(b-a). 
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Since a < b, b— a £0, and so we may divide to obtain 


il, b 
< — < M. 
m< =i f(a)dx < 


Now, the function f is continuous on [a,b], attains the values m and M, and ;+- i f(x) dx 
is a value between m and M. The Intermediate Value Theorem (see [2]), implies that there 
exists c in [a, b] such that 


b 
f= 5 / fla) de. 


Multiply each side of the above equality by b — a to obtain the theorem. 


Remark 2.3.32. The area interpretation of integration helps us visualize the mean value of a 
function on an interval. For instance, if f > 0 and continuous on the interval [a,b], then the 
mean value of f on [a,b] is the height of a rectangle with base [a,b] such that the rectangle has 


the same area as that below the graph of f and above the interval [a, }]. 
For instance, as we shall see in Example|2.4.13} the mean value of y = f(x) = 1/(1+ 27) on 
the interval [—1, 1] is 7/4. 


1.25 
y=1/(1+x°) 


y=/4 


Figure 2.30: Equal areas under y = 1/(1 +27) and y= 7/4. 


The Mean Value Theorem for Integration guarantees that, since f(x) = 1/(1 + 2?) is con- 
tinuous, there exists at least one x value in [—1,1] at which f attains its mean value. In fact, 


we see in Figure that there are two such x values, approximately +0.5. We calculate these 
values precisely in Example |2.4.13 
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2.3.1 Exercises 


Calculate the definite integrals. 


7 
1 f Sar. 
3 
6 
| 5t — Tdt. 
= 
8 
3. | ju — 4| + 2du. g 
4 


7 
4, / 2|4 — 82| dz. 


=3 


12 
5. : 5w dw. 
0 


6. [w+sa 


=35 


3 
Cs | xv +adz. g 
2 


6 
: I. f(x) dx, where f(x) = 


7 2 
9. | e” dz. 
4 


Answer the following true/false questions. If the statement is true, why is it true? 


3 r<—l 
—5 a2>-l. 


(oe) 


If the statement is false, provide a counterexample. 


b 
10. If f is differentiable on an open interval containing [a, b], then / f(a) dx exists. 
b b 
11. it [ f(a) da < / g(x) dx, then f(x) < g(a) for all x € [a, b]. 


12. If f is integrable, then f is continuous. 


b a 
13. / f(a) dx = | f(a) dx for all integrable functions f. 
a b 
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In each of exercises 14 through 18, calculate the average value of the function over 


the interval. 


14. c(x) = —10, [-11, 9]. 


15. g(x) = 12a — 10, [4, 9}. 


16. h(x) = . (0, 12]. g 


17. m(t) = |t| +3, [-3, 6]. 


18. p(x) = a ae [—2, 4]. 


19. Recall that the height of a projectile with initial velocity vo and ho is 


1 
A(t) = soe + uot + ho. 


Calculate the average height between times t = 0 and t = ¢). ic) 


0, if x is rational; 


i is not integrable. 


if x is irrational 


? 


20. Prove that the function f(a) = 


21. Construct an example of a function that is integrable but that does not satisfy the mean 
value property. That is, construct a function that does not achieve its average value over 


its domain. 


In each of Exercises 22 through 25, you are given the velocity function of a parti- 
cle, in meters per second. Calculate the average velocity of the function over the 


specified interval. 


22. v(t) = 3t + 2, [4, 8]. 
23. v(t) = t? — 5t +3, [2,6]. 
24. u(t) = t? — 2t7 + 3t + 2, [-1,3]. 
25. u(t) = |t—4|, [2,6]. 
In Exercises 26 - 29, you are given the same velocity function and interval as in the 


previous four problems. For each problem, calculate (a) the acceleration function, 


and (b) the average acceleration over the interval. 
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26. v(t) = 3t +2, [4,8]. 

27. v(t) =t? — 5t +3, [2,6]. 

28. u(t) = t? — 2t7 + 3t + 2, [-1,3]. 
29. v(t) = |t— I, [2,6]. 


30. Suppose p;(t) and p(t) are the position functions of two particles and that p2(t) = pi(t)+C 


where C is some constant. The position functions have common domain {a, 0]. 


a. What is the relationship between the average positions of the two particles? 
b. What is the relationship between the average velocities of the two particles? 


c. What is the relationship between the average accelerations of the two particles? 


31. Suppose that v1 (t) and v(t) are the velocity functions of two particles and that v2(t) = 


v,(t) + C where C is some constant. The velocity functions have common domain [a, }]. 


a. What is the relationship between the average velocities of the two particles? oc 


b. What is the relationship between the average accelerations of the two particles? 


32. Is the function below integrable on [—7, 7]? 


_ Jsin(1/z) 2 40 
ney={" r=0. 


33. Is the function below integrable on [—7, 7]? 


_ jaxsin(l/z) «#0 
ae)= {5 2=0. 


Estimate the definite integrals using the given partitions using the left endpoint 


sample set. 
1 
34. | sin(ra) dx, P = {0,1/4,1/2,3/4, 1}. © 
0 


1 
35. | Vx dx, P = {0,1/4,1/3, 3/4, 1}. 
0 


1 
1 
36. d. = {—1,-—1/2,0,1/2, 1}. 
[ah P= Ch -12.01/2) 
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1 
37. / cosh ada, P = {—1,—-1/2,0,1/2, 1}. 
-1 


Calculate the definite integrals below by determining the area enclosed between 
their graphs and the z-axis. Remember that area below the z-axis is counted with 


a minus sign in the integral. 


7 
38. i 2x2 —3dz. 


—4 


5 
39. f V 25 — «2 +4dz. g 
—5 


6 
40. i |x — 3| + 5dz. 
2 


The idea of using integrals to calculate the area between a curve and the z-axis can 
be generalized. To calculate the area of the region between the graphs of f(x) and 
g(x), we just need to notice that if f > g > 0, then the area between the graphs 
of f and g is the area bounded by the graph of f and the z-axis minus the area 
bounded by she graph of g and the z-axis. By the linearity of the integral, this area 


is given by / (f(x) —g(z)) dz. In each of Exercises 41 through 44, calculate the area 
bounded by the two graphs along the interval. 


41. f(a) = 4a, g(x) = 2, [8, 12]. 
42. f(x) = 5x, g(x) = 32, [0,10]. 


AS. f(o) = 2, g(2) = 07, [0,1]. ce 


44, f(x) = 25 — x2, g(x) = 3, [—4,4]. 


45. Consider a triangle contained entirely in the first quadrant with vertices P = (#1, y1), 
Q = (x2, y2) and R = (x2, y3). Assume further that x, < x2 and that ys > yo. 


a. Show that the equations of the lines between P and R, and P and Q are 


f(x) - Y3 — Y1 ‘ x2(y3 — y1) 


T Y3 
2 — Ly LQ — LY 
y2—- 1 L2(Yy2 — yi) 
g(x) = v T Y2 
TQ — £1 TQ — £1 


respectively. 
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46. 


AT. 


x2 
b. Calculate ‘ f(x) — g(x) dx and conclude that the area of the triangle is 


Ly 


1 1 
2 [T24y3 — Loyo + 21 Y2 — L1y3] = 5 (21 — ©2)(Y2 — y3)- 


Suppose a 10 meter long rod lies along the x-axis The rod is a rectangular prism with 
cross-sectional area 2 m?. The density of the rod is given by 62 = (a? +1) kg / m°. 
Express the total mass of the rod in terms of a definite integral. You need not calculate 


the integral. 


Suppose a 10 meter metallic cone lies along the x-axis with its vertex at the origin. More 
specifically, the projection of the cone onto the xy plane is defined by the line segments 
y =a and y = —2 for x € [0,10]. The density at each point in the cross section of the rod 
is given by 6(x) = (12—2) kg/m°. Express the total mass of the cone in terms of a definite 
integral. You need not calculate the integral. Hint: in this case, dA, the infinitesimal area, 


is no longer constant. 


Recall that a parameterized planar curve with domain |a,}] is a map a(t) into the 
xy-plane. That is, a(t) = (x(t), y(t)). Definite integrals can be used to calculate the 
length of curves. Specifically, if a(t) is a differentiable curve on [a,b], the length of 


b 
the curve is / |a’(t)| dt. Calculate the lengths of the curves in Exercises 48 - 51. 
a 


48. 


49. 


50. 


51. 


52. 


a(t) = (Roost, Rsint), [0,7], where R > 0. 


Prove the constant multiple portion of Theorem |2.3.19} i.e., prove that if f is Riemann 
integrable on the closed interval [a,b] and c is a constant, then cf is Riemann integrable 


[cites - cf seat. 


on [a, 6], and 


2.4. THE FUNDAMENTAL THEOREM OF CALCULUS 139 


> 
2.4 The Fundamental Theorem of Calculus 


The definite integral has many applications, all of which stem from the fact that the definition of 


the definite integral as a limit of Riemann sums makes the definite integral a “continuous sum of 
infinitesimal contributions”. However, actually calculating definite integrals by taking limits of 
Riemann sums is extraordinarily painful, and, for definite integrals to really be manageable, we 
need an easy way to calculate these limits of Riemann sums that come up in so many physical 


situations. 


The second part of the Fundamental Theorem of Calculus, Theorem |2.4.10| provides this 
“easy” method; it tells us that we may calculate definite integrals by anti-differentiating. This 
theorem is truly “fundamental” , for it tells us that the two basics ideas of Calculus, differentiating 


and integrating, are inextricably related. 


In Proposition |2.3.12| Proposition |2.3.22} and Proposition |2.3.27| we gave formulas for the 


definite integrals of f(x) = k, g(x) = x, and h(x) = 2? over the interval [a,b]. Those formulas 
may not have had an obvious pattern, but it may become clear what’s going on after we introduce 


some new notation. 


Definition 2.4.1. If F(a) is a real function, defined at x = a and x = b, then we define 


the evaluation notation, : by 


We also write 4 
F(z)|_,, = F®)-F@), 


@w=a 


if we want to emphasize the name of the variable that takes on the values a and b. 


We read this as: F evaluated from a to b, or F(x) evaluated from x =a tox =b. 


Now that we have this new notation, let’s rewrite our formulas from Proposition |2.3.12 


Proposition |2.3.22| and Proposition |2.3.27| We have 


b 
/ kdx = k(b—a) = kal’, 
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and 


Do you see the pattern here? It may not be clear from just three examples, but you may be 


able to guess what happens more generally. 


In each of the three formulas above, the derivative of the function being evaluated from a to 
b on the right is precisely the function being integrated from a to b on the left. In other words, 
the function being evaluated from a to b on the right is an anti-derivative of the integrand. 


Is this true more generally? YES. This result is part of the Fundamental Theorem of 
Calculus, and it is the reason that the notation f f(x) da and terminology “indefinite integral” 


for anti-derivatives are so similar to the notation and terminology for definite integrals. 


Once we have the Fundamental Theorem, we will no longer need to compute definite integrals 
by the cumbersome process of taking limits of Riemann sums. All of our anti-derivative formulas 


from Chapter [I] will become formulas for computing definite integrals. 


Understand: ALL of the applications of the definite integral are due to the fact that it 
represents a continuous sum of infinitesimal contributions, i.e., that the definite integral is 


a limit of Riemann sums. The Fundamental Theorem of Calculus is NOT the definition of 
the definite integral; the value of the Fundamental Theorem is that it allows us to calculate 
these continuous sums much more easily. 


We need a few results before we can prove the Fundamental Theorem of Calculus. 


Suppose that f is a Riemann integrable function on the interval [a,b]. By Theorem |2.3.16 
(and Definition |2.3.17), for all x in [a,b], the function f is also integrable on the interval [a, x]. 
Thus, we may make the following definition. 


Definition 2.4.2. Suppose that f is a Riemann integrable function on the interval [a,b]. 
Then, the integral function of f on [a,b] is the function gen with domain [a,b] and 
codomain (—oo, 00), given by 


Hea) = fo pat 


Note the use of the dummy variable ¢ in the integrand; it would be confusing to use x in 


the integrand, as x is the upper-limit of integration. We did not have to use t as our dummy 
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variable (though it is a standard choice); we could have used any variable name, other than z, 
a, b, and f. The dummy variable d would also be a bit confusing. 


It will be important to us that the following theorem is true, and its proof is instructive, 


since it uses a number of basic properties of integration. 


Theorem 2.4.3. Suppose that f is Riemann integrable on the interval [a,b]. Then, the 


integral function, i), of f on [a,b] is continuous. 


Proof. Let c be in [a,b]. We need to show that the limit, as « approaches c, of i (a) is equal 


to I fe (c), where, if c equals a or b, we mean that we take the corresponding one-sided limit, so 
that we are always looking at x’s in the interval [a,b]. For simplicity, we will use the two-sided 
limit notation throughout the proof. 


We will show that 
lim i (x) = a (c) 


by showing that 


lim 
Lae 


a,b a,b 
74a) — rf Io) = 6, 


a, a,b a,b a,b 
Note that |7!"(2) — r! I(o)| =|} l(c) — I! I(a)). 


Now, 
Hela) — 1PM) = f° seas - f sear = f° sodae 


and, hence, we wish to show that 


lim 
re 


[ foe = Gi 


We accomplish this by using that, since f is Riemann integrable on [a,b], f is bounded on 
[a,b], by Theorem|2.3.6} Thus, there exists M > 0 such that, for all x in [a,b], —M < f(x) < M. 
Now, if x < c, the monotonicity of integration, Theorem |2.3.20} implies that 


—M(c-—2) < [rma < [soa a [ma = M(c—x2). 
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Similarly, if c < x, then 


Hence, in either case, we obtain 


| | #(e) ae < M|c— al. 


As x approaches c, |c — z| approaches 0, and, thus, so does | sn f(z) de. 


Remark 2.4.4. The conclusion of Theorem is that the function on [a,b] that sends x to 
J° f(@ dt is continuous. 


An essentially identical argument shows that the function on [a,b] that sends x to pr f(t) dt 


is continuous. 


In Definition we defined what is meant by an anti-derivative of a function f on an 
open interval J; it’s a function F on I such that F’ = f (at all points in J). We also reminded 
you that the Mean Value Theorem for Derivatives implies that any two anti-derivatives of the 
same f must differ by a constant. That is, if Ff, and F2 are both anti-derivatives of f on J, then 
there exists a constant C' such that, for all « in J, F(x) = Fo(x)+C. 


It will be useful for us to define the notion of an anti-derivative of a function on a closed 
interval, or on any interval whatsoever, and to prove that any two such anti-derivatives differ 


by a constant. 


Definition 2.4.5. Let f be a function on the interval J. An anti-derivative of f on J 
is a continuous function on J, which, on the interior of J (the open interval of points other 
than possible endpoints of the original interval), is an anti-derivative of f, i.e., a continuous 


function F on J such that F' differentiable on the interior of J, and such that, for all x in 
the interior of J, F’(x) = f(x). 
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Theorem 2.4.6. Suppose that f is a function on the interval J. Then, any two anti- 


derivatives of f on J differ by a constant. 


Proof. This follows easily from the result on open intervals. If J consists of a single point, then 
the result is trivially true. So, assume that is not the case, i.e., assume that the interior of J is 


a non-empty open interval. 


Suppose that both F and F» are anti-derivatives of f on J. Then, we know that there exists 
a constant C such that, for all x in the interior of J, Fi(x) = Fo(x) + C. 


Suppose that a is a left endpoint of the interval J, and that a is contained in J. As F, and 


Fy are continuous at a, we have 


Fi(a) = lim Fi (2) - lim, (F(a) +C) = Fo(a)+C. 


wa za 


In the exact same fashion, we see that, if b is a right endpoint of J, which is contained in J, 
then F,(b) = F(b) + C. 


We are now going to prove the Fundamental Theorem of Calculus. The theorem is usually 
broken into two parts. The first part tells us that the integral function ne of a continuous 
function f on the interval [a,b] is an anti-derivative of f on [a,b]. The second part tells us that, 
if we already know an anti-derivative of f on [a,b], then we can use that to evaluate p f(x) dz. 


It is this second part that is used most often in applications. 


Theorem 2.4.7. (Fundamental Theorem of Calculus, Part 1) Suppose that f is 
Riemann integrable on [a,b] and is continuous at a point xo in (a,b). Then, the integral 


function jie of f on [a, b| is differentiable at xo and 
f 


(1")' (9) = f(e0). 


Thus, if f is continuous on [a,b], then i is an anti-derivative of f on {a,b}. 


Proof. See Theorem |2.A.10 
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If f is continuous on an interval J, then, for all a and z in J, f is continuous on the interval 
[a,x] or [x,a] (depending on which interval is defined), and so, in either case, [” f(t) dt exists. 
It is easy to conclude the following corollary from Theorem 


Corollary 2.4.8. Suppose that f is continuous on an interval J, and that a is a point in 
J. Then, the function of x, with domain J, given by 


i f(t) dt 


is an anti-derivative of f on J. 


Remark 2.4.9. The first part of the Fundamental Theorem, and its corollary, are of great 
theoretical importance; they tell us that all continuous functions have anti-derivatives. 


For instance, quick...what’s an anti-derivative of the continuous function f(z) = e * ? 
> q 


That’s easy: F(x) = fy e~* dt is an anti-derivative of e~*”. 


Understand: this means that, if you want to calculate 
d x 
— ( | ee ir) ; 
dx \ Jo 


you don’t first calculate ie e~® dt, and then take the derivative. You simply apply the Funda- 


mental Theorem to conclude immediately that 


a (/ et it) =e. 
dx 0 


What’s another anti-derivative of e~*”? Easy again: for edt. 


How can both [” e~® dt and [” e~© dt be anti-derivatives of e~*? Because they differ b 
0 37 y 


a constant: 
x az 37 
—t? _# = _# 
e dt — e dt = e dt, 
0 37 0 
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which is just a fixed constant. 


We should make a final comment. Recall Remark}1.1.23} where we stated that the function 
f(z) = e-*” has no elementary anti-derivative. This is true. The Fundamental Theorem 
guarantees that continuous functions possess anti-derivatives; it does not tell us that those 


anti-derivatives have to be easily expressible in terms of familiar functions. 


Anti-derivatives of elementary functions need not be elementary functions. 


We now come to the second part of the Fundamental Theorem of Calculus; the part that we 
discussed at the beginning of the section, the part that is most often used in applications. In a 
way, the result that we will state, and prove, should seem kind of obvious. 


Suppose that F(x) is an anti-derivative of f(a) on [a, 6]. Then, for all x in (a,b), f(a) = F’ (x), 
and the derivative F’(a) is the limit of the change in F divided by the change in 2, i.e., the limit, 
as Az approaches 0, of AF/Az. But what’s the definite integral, i. f(x) dx = ft F'(x) dx? You 
take limits of Riemann sums; you take the limit of what you get when you multiply values of 
F’ times small changes in x, and add these together. In other words, as x goes from a to b, we 
add up a bunch of quantities of the form 


and we take the limit of this as Ax approaches 0. Now we use that the sum }~>(AF) telescopes, 
just as in Proposition to give us F(b) — F(a). Therefore, the Riemann sums approxi- 
mately equal F'(b) — F(a) and, as Az approaches 0, the approximation “should” become an 
equality. Technically, we need that f(x) is continuous, but our informal reasoning should make 


the following theorem easy to believe. 


Theorem 2.4.10. (Fundamental Theorem of Calculus, Part 2) Suppose that f is 
continuous on an interval [a,b], and that F is an anti-derivative of f on [a, }]. 
Then, 


b 
[ fa = Fo) 


Proof. This actually follows very quickly from our previous results. 
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By Theorem i is an anti-derivative of f on [a,b]. As F is also an anti-derivative of 
f on [a,b], Theorem implies that there exists a constant C such that, for all x in [a, }], 


a f)dt = F(x)+C. 


When « = a, the left side of the above equality is 0, and we obtain that 0 = F(a) + C, ie, 
C = —F/(a). Therefore, for all x in [a, }], 


[ f(tj)dt = F(a) — F(a). 


When «x = b, we obtain 
b 
‘ f(t)dt = F(b) — F(a), 


which, noting that we may now use x as our dummy variable of integration, is what we wanted 


to show. 


It is now easy for us to calculate definite integrals by using our anti-derivative formulas from 
Chapter 


Example 2.4.11. Calculate fe (7x3 + 3,/@ + 2) dz. 


Solution: 


This calculation would be ridiculously complicated if we had to explicitly use limits of Rie- 
mann sums. But, we don’t have to use Riemann sums; we now have the Fundamental Theorem 


of Calculus. 


From our formulas in Section [1.1] we quickly find that 


at 3/2 


7-—+3-2_+5lne = a 42 3/24 5ing 
4 3/2 ie Melon 


is an anti-derivative of 7x° + 3,/x + 2 on [1,4] (actually, on all of (0, 00)). 
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Therefore, the Fundamental Theorem tells us that 


4 


5 7x4 
| (10 + 3ve+ >) dx = (+205 +52) 
1 


1 


44 44 
(FE +247 45in4) - (FE 42-1974 5m1) = 


448+ 16+ 5In4 : 2-0 = + 5In4 = 467.18147. 


You may be thinking “Wait — wouldn’t I get a different answer if I used a different anti- 


derivative of 7x3 + 3\/x + 3” The answer had better be “no”, and it is. 


Any anti-derivative of 72° +3, /z+ 2 differs from the one we used by some constant C’. Thus, 


the only other possibilities for us to use for the anti-derivative are all of the form 


4 
M4228? + 5ine +6. 


Does the C’ cause us to get a different answer? No, because when we evaluate from 1 to 4, the 


C' gets cancelled out: 
Tax* : 
G +205? 4 5ine +C) 


1 


7-44 3/2 714 3/2 
a a sl +51ln4+C]—- qos +5Inl+C) = 


. 4 . 4 
(i 42.43/2 +514) = (= 42.13/2 +5101) 
which is what we had before. 


The moral of the story is that, when evaluating definite integrals by use of the Fundamental 


Theorem, you just use SOME anti-derivative; you don’t need to ever put in a general +C. 


Example 2.4.12. Find the area under the graph of y = sinz and above the interval (0, 7]. 
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Figure 2.31: The area under part of the graph of y = sina. 


Solution: 


This area is simply 


Tv 
i sinzdx = —cos x |- = —cosm—(—cos0) = 141 = 2. 
0 


Example 2.4.13. Find the average value of 


1 


fT a 


on the interval [—1,1], and find all values of c in [—1,1] that are guaranteed to exist by the 
Mean Value Theorem for Integration, Theorem |2.3.31} i.e., all values of c in [—1,1] such that 
f(c) equals the average value of f on [—1, 1]. 


Solution: 


By definition (see Definition |2.3.30), the average value of f on [—1, 1] is equal to 


1 
: / 1 de. 
i a La 
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By the Fundamental Theorem and Theorem |1.1.13} we find that this equals 


1.25 


y=1/(1+x’) 


y=m/4 


Figure 2.32: Equal areas under y = 1/(1 +27) and y = 7/4. 


For what c in [—1,1] does f(c) = 7/4? We solve 


1 


_ot 
1+e2 4 
and find that we must have 
2 4—T 
c= : 
T 
L.eé., 
4— 
e=4 7 +0.5227232: 
T 


both of which are in the interval [—1, 1]. 


Example 2.4.14. In Example ]2.3.28] we discussed the fact that the definite integral 


/ Vr2 — a? dx 
0 
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yields one quarter of the area inside a circle of radius r. 


Figure 2.33: Area under y = Vr? — x?. 


Then, we used our knowledge from high school geometry about the area inside a circle of 


r 2 
i Vr2—22 dr = _, 
0) 4 


However, now that we have the Fundamental Theorem, we can use integration/anti-differentiation 


radius r to conclude that 


to verify that the area inside one quarter of a circle of radius r is tr?/4, i.e., that the area inside 


2 


a circle of radius r is mr“. In other words, we can show that the formula from high school 


geometry is correct. 


In Example}1.2.6| we found (replacing a with r) that 


[VP=2 ae = ac r2—¢2+4r7sin7} (=)] +0. 


Thus, the Fundamental Theorem tells us 


r 1 
| Vre-a?dx = = E r2 — 92447? sin! (=)] 
0 2 r 


5 ((0-+7?sin“*(1)) — (0+ r? sin1(0))) =5:P 55> 
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Substitution in Definite Integrals: 


We discussed the use of substitutions in finding anti-derivatives in Theorem|1.1.15| Now that 
we have definite integration and the Fundamental Theorem, we can state a similar substitution 


for definite integrals. 


Theorem 2.4.15. (Substitution in Definite Integrals) Suppose that g is continuously 
differentiable on an open interval which contains the closed interval [a,b], and that f is 
continuous on the closed interval between the minimum and maximum values of g on [a,b]. 


Then, 
g(b) 


b 
/ f(g(a)) g!(«) dx 


Proof. We will assume that g(a) < g(b). The case where g(b) < g(a) is essentially identical. 


As f is continuous on [g(a),9(b)], f possesses an anti-derivative F’ on [g(a), g(b)|] by Theo- 
rem|2.4.7| i.e., there is a function F’, which is continuous on [g(a), 9(b)], and such that, for all x 


in (g(a), 9(b)), F’(a) = f(a). 


Since the composition of continuous functions is continuous, and, for all x in (a,b) 


(Fog) (x) = F(g(a))-9'(2), 


it follows that F'o g is an anti-derivative of (f 0 g)-g’ on [a,b]. The hypotheses guarantee that 


f(g(x)) g(a) is continuous on [a, 6]. 


Therefore, we may apply the Fundamental Theorem, Theorem |2.4.10| to both sides of the 
desired equality, and conclude that they are both equal to F'(g(b)) — F(g(a)). 


Example 2.4.16. Let’s calculate ie xe~* dx in two slightly different-looking ways. 


Our first method is to note that re~*” is continuous everywhere, and use the Fundamental 
Theorem, Theorem|2.4.10} to calculate the value of integral. We proceed as we did for indefinite 
integrals in Section except that we carry along the limits of integration throughout the 


process. 


Let u = —z”, so that du = —2x dz. Then, rdz = —du/2, and obtain 


2 4 x2=2 ai x=2 
i ze” dx = | e"(—du/2) = =| e“ du, 
0 z=0 2 x=0 
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where we need to explicitly write x = 0 and x = 2 for the limits of integration, not just 0 and 2. 


This is important: if we wrote simply es e“ du for the integral, then the limits of inte- 


gration would be telling us values of u, and then the Fundamental Theorem would give us 
—(e? — 1)/2 = (1 — e?)/2 for the value of the integral. This is wrong. 


Proceeding correctly, we find 


Notice that, in the above process, all that we were really doing was calculating an anti-derivative, 
as we did in Section [1.1] and carrying the limits of integration « = 0 and x = 2 throughout the 


calculation. 


What’s the other method? To apply Theorem |2.4.15} and actually change the limits of 
integration to describe what u does. This saves us cumbersome notation, and means that we 
don’t need to reinsert what u was, in terms of x, at the end of the calculation. Aside from that, 


the process looks the same. 


So, as before, let u = —x?, so that du = —2x dx. Then, of course, x dx = —du/2, but, now, 
we also note the values of u that we should use for the limits of integration in the u integral; 


when x = 0, u = —0? = 0 and, when x = 2, u = —2? = —4. Therefore, we obtain 


2 -4 _ -4 -4 
ai 4 = 4h = 
: ce® dx = | e"(—du/2) == —e" Se = as, 


Thus, we see that, when substituting into a definite integral, changing the limits of integration 
to describe what your substitution variable is doing saves some time — maybe not a lot of time, 


but some. 


However, there are some times when changing your limits of integration saves you a LOT 


of time. Consider the integral 


1 
| e (2D)? (29 —1)dz. 


0 


Make the substitution u = 2(a — 1) = 2? — 2, so that du = (2x — 1) dz. Then, the integral 


xr=1 6 
/ e " du. 
x=0 


becomes 
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We cannot produce a “nice” anti-derivative of ew: this function has no elementary anti- 
derivative. See Remark]1.1.23| Nonetheless, had we changed our limits of integration to describe 


what u does, we would have found that, when x = 0, u = 0, and when x = 1, u = 0; thus, our 


0 2. 
if e“ du = 0. 
0 


Therefore, in this example, switching to the u limits of integration enables us to calculate an 


integral becomes 


integral that we would not be able to calculate otherwise. 


2.4.1 Exercises 


In each of Exercises 1 through 15, calculate the definite integrals using the Funda- 
mental Theorem of Calculus. 


5 
1. ; ee+a2+2dz. 
3 


6 
Y 
2. —*— dy. 
oe g 


10 dw 


5 Vw2+25- 


3 
dv 
a eamaaaeees 7 
I, Ope" 
4 
J | cost(5y) ay 
0 
¥ = 
1 U 


10. a sint cost dt. 
0 


oo 
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1 
dx a. , ; ee ‘ 

16. Suppose we want to calculate / —. Since In|a| is an anti-derivative of the integrand, 
x 


= 
the Fundamental Theorem of Calculus tells us that 


1 
d 1 
/ < =In|c =Inl—In1=0. 
_1 v -1 


What is wrong with this argument? 


fra 
17. Suppose that f is continuous on [a,b] and that G(x) = / f(t) dt. What is dG/dx? Hint: 


Use the Chain Rule. io 


In each of Exercises 18 through 22, find the average value of the function on the 


closed interval if it is defined. If it is undefined, explain why. 


18. k(x) = V9 — a?, [2,5]. 
19. j(y) = V1+y?, [0, 1]. 
20. h(z) = = [—2, 2]. 


21. g(u) =coshu, [—k,k], k > 0. 


22. f(v) =secv, [0,7]. 


23. Find the area under the graph of y = 3e” + 2 and above the interval [0, ln 7] on the z-axis. 
Note that 3e” + 2 > 0. 
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3 
24. Find the area trapped between the graph y = y(x) = —24 ae and the interval [—1, 1] 
x 
on the z-axis. Note that, for x in [—1,1], y(x) is sometimes positive and sometimes 


negative. You need to calculate the total area, both above and below the given interval. 


Given a probability density PaeHOn f(x), for a continuous random variable, X, the 


probability that a < X < bis [is f(x) dx. 


25. A continuous random variable is uniformly distributed if f(a) = C for some (necessarily 
positive) constant. This means that any all possible outcomes are equally likely to appear. 
Suppose the life-span of a certain organism is uniformly distributed between 4 and 9 years 
and the probability density function is f(#) = 1/5. What is the probability a given 


organism will live for 6 to 8 years? 


26. If I = [a,b] is the set of all possible outcomes of a random variable with density function 


f(x), then it must be true that / f(a) dx = 1. This means the probability the random 


variable falls within the set of all its possible outcomes is 100%. Prove that a uniformly 


distributed random variable on the interval [a, b] has density function f(z) = 1/(b— a). 


27. A random variable has a range of possible outcomes between 1 and 4. ic) 


a. The density function is f(a) = cax?. What is c? 


b. What is the probability of an outcome between 2 and 3? 


The expected value or expectation of random variable X is a weighted average of 


the possible values that the random variable can achieve and is denoted E(X). For 


a discrete variable, E(X) = S- x;p(z;) where each x; is a possible outcome that will 
i=1 
occur with probability p(z;). For a continuous random variable that can achieve a 


b 
value on the interval [a,b], E(X) =) xf («)dx where f(x) is the density function. 


28. Suppose two standard six-sided dice are rolled and we’re interested in the average, or 
expected sum on the two dice. Thus, we let X can be any integer between two and 12. 


a. Complete the table below by determining the probability of each possible outcome 
and entering it in the second column. Note that there are 36 possible outcomes— 
(1,1), (1,2), (1,3),.. ete. For example, there are four ways to achieve a sum of five: 
(1, 4), (4, 1), (2, 3), (8, 2). The probability of a sum of five is therefore 4/36 = 1/9. 
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p(xi) | ri: (xi) 


4/36 | 20/36 


CoONIankwnr|s 


12 


b. What is the most likely sum? That is, what sum has the highest probability of 


occurring? 


c. Fill in the third column by multiplying the first two columns together. In row 5, for 
example, x;p(a;) = 20/36 = 5/9. 


d. What is the sum of all the entries in the third column? This is the expected, or 


average sum. 


29. What is the expected value of a continuous random variable uniformly distributed over 
the interval [a, b]? 


30. Let f(x) = (1 — 2”). 

a. Verify that this is a legitimate density function of a random variable with possible 

1 

outcomes on the interval [—1, 1] by verifying that i f(a) dx =1. 
-1 
b. What is E(X)? 

More generally, we can define the expectation of a function of the random variable. 
For example, in the dice example above, we may have been interested in the square 
of the sum of the dice, rather than just the sum. The expectation of the square of 


the sum is Ce -p(a;). In general, if g(X) is a function of a discrete variable, we 


write: E(g(X)) = S- g(a) -p(a;). If g(X) is a function of a continuous variable, we 
b 
write: E(g(X)) = / g(x) - p(a) da. 


31. What is F(X?) in the dice example? 
32. What is E(X?) for a uniformly distributed random variable on the interval [a,b]? 


33. What is E(X?) for a random variable with domain [—1,1] and density function f(z) = 


3(1— 2)? 
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Variance measures the dispersion of a variable, or how spread out it is. Let 1 = E(X). 
Then, the variance of a discrete variable is Var(X) = Se —p)’ f(x). The variance 


b 


of a continuous random variable is Var(X) = / (2 — )* f(x) dz. 


34 


30. 


36. 


37. 


38. 


39. 


AO. 


a 


. Show that for a discrete variable, Var(X) = E(X?) — E(X)?. 


Show that for a continuous variable, Var(X) = E(X?)— E(X)?. Hint: the proof is nearly 
identical to the previous problem. This shows that the rules governing finite sums are in 


some sense not that different from the rules governing definite integrals. 
What is the variance of the sum of two dice? 
What is the variance of a uniformly distributed random variable on the interval [a, 6]? 


What is the variance of a random variable with density function f(x) = 3(1— 2x)? 


qT 


Let a and b be positive integers. Prove that / sinaxcosbadx = 0. Hint: prove that 


TT 


1 
sin ax cos ba = 5 (sin(ax + ba) + sin(ax — ba)) and use the fact that cosa is an even func- 
tion. 
Tv 
Let a and b be positive integers. Evaluate / sin ax sin bx dx when 
—v 
a. a #0; 
b.a=b. 


Use the Fundamental Theorem of Calculus and the Chain Rule to solve Exercises 
41 through 44. 


Al 


A2. 


43. 


AA. 


45. 


A6. 


d 32 
. What is =| 3t dt? 
dx Jo 
d 5a 
What is — t? dt? 
22 


ae 
What is =| 1du? 
dx J, 


2 
d zx 
What is =f xt dt? 
dx ) 
a2 
Let g(x) = fy e*** dt. What is g'(2)? Hint: move e* outside the integral. 
Let a and b be positive integers. Evaluate / cos(ax) cos(ba) dx when 


158 CHAPTER 2. CONTINUOUS SUMS: THE DEFINITE INTEGRAL 


a. a#b; 
b. a=b. 
47. Let A= [ sin! © ae, 
sin® a+ sin’ x + cos* x 
k 
cos” (2 — 
a. Show that A = 7 (3 a dz. 
o  cosk (5-2) + sin’ (5-2) 

T /2 cos* u 
b. Make the substitution u = ~— — x and show that A = i — du. Let 

2 sin” u + cos* u 


the right-hand side of this equation be B. 
c. Prove that A+ B= 7/2 and conclude that A = 7/4. 


4 75 
48. Consider the expression | j x’y+2y dx dy. We evaluate by first viewing y as a constant 


2 J1 
and integrating with respect to x, and then integrating with respect to y. 


a. Evaluate ih x*y +2ydx. Assume y is a constant. 
b. Take the definite integral of your answer to part (a) with respect to y. 
c. Do you get the same answer if you switch the order of integration? That is, is your 


answer to part (b) the same as il | xy + 2y dy dx? 
1 J2 


Use the method in the previous problem to evaluate the integrals in the next two 


exercises. 


20 1 

a | J raras, 
o Jo 
1 20 

b. I I r dé dr. 
0 Jo 

a | i sin x cos y dx dy. 
o Jo 

b. f 7 sin x cos y dy dx. 
o Jo 


It’s not always true that switching the order of integration results in the same value. However, 
if a function is reasonably well-behaved, then switching the order of integration has no effect on 
the final outcome. This is the content of Fubini’s Theorem. 
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> 
2.5 Improper Integrals 


In this section, we will define the integral of a function over a set of real numbers, where the set 


of real numbers need not be a closed, bounded, interval; we also define integrals when the value 
of the integrand is unbounded, even if the interval of integration is itself bounded. This really 
requires a new definition, for Theorem tells us that the Riemann integral of an unbounded 


function does not exist. 


The actual calculation of our new type of integral will involve calculating our usual integrals 
on intervals of the form [a,b], where now either a or b varies, and we will then take a limit 
as a or b approaches some “problematic” value. Of course, the calculation of the integrals on 
the intervals [a, b] can/will still use the Fundamental Theorem of Calculus, and, hence, we may 
apply all of our techniques from earlier sections to find anti-derivatives. 


As we wish to be able to discuss integrals over intervals [a,b] and over intervals (a, b] (and 
over other sets), the notation fp does not suffice to distinguish between the types of intervals 


that we care about. Thus, we adopt some new notation; if FE is a subset of the real numbers, 


| diate 


for the integral of f over the set EF. Of course, right now, this has no meaning for us, unless £7 


we will write 


is a closed ee [a,b], in which case : f(a) dx = Sia, of (x) dx. Our goal in this section is to 
define [,, f(x) dx for sets E that need not be intervals. 


Before we really start looking at new types of integrals, it will be helpful to have a new piece 
of terminology. 


Definition 2.5.1. If f is a real function, with domain D, an extension of f is a function 
ia whose domain is larger (or equal to) D, and such that f and 7 agree at all points of D, 
i.e., for allx in D, f(x) = f(z). 


In other words, an extension ia of f, is a function whose domain includes the domain, 
D, of f, and such that the restriction of f to D is equal to f. 
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Example 2.5.2. For example, the function f, with domain (0, 1] given by 


sin xz 


if0<a2<1; 
x 
1 ifx=0 


is an extension of the function f(a) = (sinz)/x, with domain (0, 1]. In fact, f is a continuous 


extension of f to [0,1], since 


Now, let’s start our discussion of more general notions of integration by looking at the easiest 


case, one where we already know the answer: the case of a function which is Riemann integrable. 


Example 2.5.3. Consider f(a) = x? on the interval [0,1]. Certainly, f is Riemann integrable on 


(0, 1], since f is continuous on [0,1]. Thus, we know what Ji x? dx means; it means Je a de, 


0,1] 
which, by the Fundamental Theorem of Calculus, is equal to (x? / 3)|, = 1/3. 

What if we omit the 0 from the interval of integration, ie., what should f, (0,1] x? dx mean? 
Theorem |2.3.14] tells us that, for Riemann integrals on closed intervals, altering the function at 
a finite number of points does not change the integrability of the function or the value of the 
integral. Thus, intuitively, it seems reasonable that omitting a single point of integration, like 0, 


should not affect the integral. Therefore, in this case, it seems reasonable to make the definition 


that 
1 
| adr = edr = —. 
(0,1] [0,1] 3 


While this seems reasonable, it should, in a way, seem like “cheating”; we wanted to integrate 
f(x) = x? on the interval (0, 1], and yet we used information about f on the larger interval (0, 1], 
i.e., we used that there was a continuous extension of f from the interval (0, 1] to the interval 
(0, 1]. Could we have defined Soo. x? dx without using the extension to [0,1]? Yes. 


For all a such that 0 <a <1, f(x) = x? is Riemann integrable on the interval [a, 1], and 


and this integral uses only that f(a) = x? is defined on (0, 1]. 
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Now, we can take the limit as a approached 0 from the right to obtain our previous answer. 
That is, we could have defined f[, (0.1) x? dx by 


The point is that, in this example, we obtain the same value for f, ( x? dx, regardless of 


0,1] 
whether we use the fact that x? extends to [0,1] or whether we instead use the limits of integrals, 


as our lower-limit of integration approaches 0. 


Example 2.5.4. Let’s consider a more-complicated example. Let f be the function, with 


domain (0, 1], given by 
sin & 


f(z) = 


x 


How many choices do we have for reasonable ways to define the integral [ (0,1] f(x) dx? At least 


two. 


First, we can define an extension f to the closed interval [0, 1], and then define 


Of course, we either need to pick a particular extension, or show that the value of f, (0,1) sd (a) dx 
is always the same, regardless of what extension we select. We could, in fact, take f to be the 
continuous extension of f given in Example then, f would certainly be Riemann integrable 


by Theorem [2.3.8] 


However, we don’t have to use continuous extension of f. Since (sin z)/x does possess 
a continuous extension to the interval [0,1], it follows that (sina) /z is bounded on (0, 1]. Thus, 
Theorem [2.3.8] and Theorem [2.3.14] imply that it doesn’t matter how we define f at x = 0; no 
matter what value we chose for i (0), f will be bounded and, at least, piecewise-continuous — 
hence, Riemann integrable — and changing the value at one point will not affect the value of the 


integral. Therefore, we could define 


[fea =f fax, 
(0,1] [0,1] 
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for any extension of f to a function f on [0, 1]. 


However, we could take the second approach, as we did in Example without using an 


extension at all, we could define 


This leads to a new question or two. If 0 < a < 1, we know that (sin x)/z is continuous on [a, 1] 


and, hence, f. sin# dz exists. But, how do we know that the limit as a > 0* exists and, even 
if the limit does exist, how do we know that that limit is equal to what we’d get by extending 
(sin x)/« to [0,1] and then taking the Riemann integral of our extension over [0,1]? 


Actually, the answers to these questions are easy, given our earlier results. Suppose that f 
is an extension of f to [0,1]. Then, as we discussed above, f is Riemann integrable on [0,1]. By 
Remark the function on [0,1] that sends a to f. f (a) dx is continuous; in particular, 


[ iow = lim [iow = lim [ see 


a—0+ a—O0t 


where the second equality above follows from the fact that, if 0 < x < 1, then f(x) = f(z), 


since f is an extension of f. 


Thus, lim,_.o+ i. f(a) dx exists and equals what we would obtain by calculating the Riemann 
integral of any extension of f to [0,1]. Therefore, it seems reasonable to define the integral of f 


over the half-open interval (0, 1] by 


What we have seen in the previous two examples is that, in those cases, it made sense to 
define Soo. f (x) dx by 


f(x)dzx = lim / f(x) daz, 


(0,1] a—0+ 


where each q f(a) dx is a Riemann integral. However, in those cases, we could also have defined 
if (0,1) f (x) dx by extending f to the closed interval [0,1] and then using the Riemann integral of 
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the extended function. Now we will look at an example where the approach via extensions does 
not work, but the limit idea still yields a meaningful result. 


Example 2.5.5. Consider the function 


on the half-open interval (0,1]. As x approaches 0 from the right, 1/\/z approaches oo, and 
so f is unbounded on (0, 1]. Hence, any extension of f to the closed interval [0,1] will also be 
unbounded and, therefore, will not be Riemann integrable (Theorem|2.3.6). 


On the other hand, using the Fundamental Theorem, and the Power Rule for Integration, 


we find 


gi/2 1 


8 ) ~ ae ee) == 


This means that the area under the graph of y = 1/,/x and over the interval [a, 1] approaches 


a 


1 
lim ao/2de = lim ( 


a>0t Ja a—0+ 


2asa—0*. We say, simply, that the area under the graph of y = 1/,/z and over the interval 
(0, 1] equals 2. 


Figure 2.34: Area under the graph of y = 1/,/z over [a, 1]. 


What we see in this example is that defining the integral over a half-open interval (a, }] 
in terms of limits of integrals over closed intervals gives us a well-defined number, while the 


Riemann integral of an extension to the closed interval [a, b] does not exist. 
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Before making a definition, we wish to look at one more example. 


Example 2.5.6. Suppose that f(x) =e~* and we wish to integrate f over the interval [0, 00). 


Note that —e~* is an anti-derivative of e 


—x 


In this example, the function f itself is bounded on the given interval; e~” is between 0 and 
1 for x in [0,co). On the other hand, the interval that we want to integrate over is unbounded, 


since it “goes out to infinity”. How should we define 
| e ° dx? 
[0,00) 


Here, there is no closed interval to which we can possibly extend f. However, the limit 


approach still yields an answer: 


b 
lim e “dx = lim (-e”* 
b—oo 0 b— 00 


This means that the area under the graph of y = e~” and over the interval [0, 6] approaches 1 as 
b — co. We say, simply, that the area under the graph of y = e~* and over the interval [0, 00) 


equals 1. 


Figure 2.35: Area under the graph of y = e~* over [0, }]. 
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In light of the above discussion and examples, we make the following definition: 


Definition 2.5.7. Suppose that f is defined on the half-open interval [a, b), where b may be 
co, and suppose that, for all c such thata <c<_b, f is Riemann integrable on the closed 


interval [a, c]. 
b 
[ tee = hes f(z)dt = sim [9 f(a 


Then, we let 
provided that the limit exists, in which case we say that f is integrable on [a,b), or that 


the integral ft f(x) dz converges. Otherwise, we say that i f(x) dx diverges. 


Similarly, suppose that f is defined on the half-open interval (a, b|, where a may be —oo, and 
suppose that, for all c such thata <c<b, f is Riemann integrable on the closed interval 


lan 
[ioe = fae = in, [se 


Then, we let 
provided that the limit exists, in which case we say that f is integrable on (a, |, or that 


the integral L f(x) dx converges. Otherwise, we say that ih f(x) dx diverges. 


Naturally, if hs f(x) da converges, we define f," f(x) dx = za f(a) dz 


Note that we do not have a notational conflict; if f is, in fact, Riemann integrable on {a, }], 


then, by Theorem and Remark the Riemann integral tis f(a) dx equals both of the 


one-sided limits of integrals given above. 


The way that we usually conclude that f is Riemann integrable on all of the closed intervals 
[a, c} (or [c, b]) contained in [a, b) (or (a, b]) is that f is continuous on the entire half-open interval. 
Assuming this is the case, the only way that the integral ic f(x) dx can possibly fail to converge 
is for either the interval of integration to be unbounded, or for the function f to be unbounded 
on the interval of integration. We give these two types of integrals which involve unbounded 


activity a name: 


Definition 2.5.8. An integral is f(a) dx, in which a or b is too, or such that f is un- 


bounded on the interval (a,b) is called an improper integral. 
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The integrals in Example and Example are improper integrals, and yet the inte- 
grals converge. The importance of improper integrals is that, for continuous functions, they’re 


the only types of integrals which might diverge. 


While the integrals in Definition are the basic new types of integrals that we are defining 
in this section, we are also interested in more-complicated integrals, ones which break up into a 
finite number of pieces which are of the types found in Definition : 


Example 2.5.9. Consider the integral 


The point x = 0 is in the interval [—1,8], the interval over which we’re supposed to integrate. 
However, as x approaches 0 from the left or right, the integrand goes to oo. This is a type of 
improper integral. The question is: how should we define what such an integral means? 


Figure 2.36: The y = «~?/3 becomes unbounded from either side of x = 0. 


The answer is: we want Theorem |2.3.16} on splitting up integrals, to remain true. This 
means that we want it to be true that 


8 0 8 
1 1 1 
[. 73 = a aa + i ps 
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where each of the summands on the right is an integral of the type we defined in Definition 
We now calculate by taking limits: 


b 8 
lim | «?3de + lim a ?/Sdx = lim (3e¥5|’,) + lim (3e'8[2) = 
b0- J_y a>0+ Jag b—0- ~ a—0+ @ 


There can also be multiple “problem points”. 


Example 2.5.10. Consider the integral 


After possibly removing a finite number of points (problem points, where unboundedness 
comes into play), we want to split the interval [1, co) into a finite number of closed or half-open 
intervals on which the integrand 1/[(a — 2)(a — 4)] is continuous; in this splitting, we allow a 
pair of closed or half-open intervals to intersect each other in, at most, one point. We then add 
together the resulting integrals, provided all of them exist; otherwise, we say that the original 


integral diverges. 


Thus, we start with the interval [1,0o). We remove the two points « = 2 and x = 4, where 
1/|(x — 2)(a — 4)] is undefined. For now, this gives us intervals [1,2), (2,4), and (4,00). The 
intervals (2,4) and (4,00) are not closed or half-open. Why do we care? We do not want to 


have to deal with some sort of simultaneous limits at the two endpoints of the intervals. 


Therefore, we further split the intervals (2,4), and (4,00). Where do we split them? At any 
point in-between the endpoints. It is a theorem, which we incorporated into the statement of 
Definition [2.5.11] that is doesn’t matter where the splitting occurs. So, we split the interval 
(2,4) into (2, 2.5] and [2.5, 4) (pairs of half-open intervals may intersect at a point), and we split 
(4, co) into (4, 7] and [7, 00). 


We end up with five half-open intervals: I; = [1,2), I2 = (2,2.5], Is = [2.5,4), I, = (4, 7], 
and Is = [7,00), whose union is equal to the original interval [1,0o), minus a finite number of 


points, and pairs of the half-open intervals intersect each other in, at most, one point. 
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Now, we define 


to equal the sum 


1 1 
I, @-ne-9* iL. @-He-p™ 


provided that all of these integrals exist, in which case we say that [- i z 


(w—2)(x—4) 
If one (or more) of the five separate integrals, appearing in the summation, diverges, then we 


dx converges. 


say that [7° aes dx diverges. 


In the remainder of this example, we will show that 


diverges, by showing that the first improper integral in the summation above diverges, i.e., we 
will show that 


‘ 1 : a i 
| Bae) 4), a =eaa 


diverges to oo. 


We find an anti-derivative of EDEL via partial fractions, as in Section [1.3] So, we first 
determine constants A and B such that 


for all x, other than x = 2 and x = 4. Clearing the denominators, by multiplying each side of 
the equality by the big denominator on the left, i.e., by (a — 2)(a — 4), we obtain 


1 = A(x—4) + B(x —- 2), 


which needs to hold for all x. Plugging in x = 2, we find that 1 = A- (—2) + B- 0, so that 
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A= -—1/2. Plugging in x = 4, we find that 1 = A-04+ B-2, and so B= 1/2. Thus, 


1 -1/2 1/2 


(x—2)(2-4) «-2 T ea 


and we want to calculate 


b b 
1 1/2 1/2 
hi —.——— dx = 1 d 2.3 
bd 1 (w@—2)(x —4) * pod 1 (=5 ’ 2) mi oa 


or show that it doesn’t exist. We need to find an anti-derivative of 1/(a— a), where a is 2 or 4. 


We accomplish this by making the substitution u = x — a, so that du = dx, and 


1 1 
i dz = fo du = Inu + C = Inija-a| + C. 
zr—-a U 


Thus, Formula 2.3] becomes 


b 


—1 1 
lim —.—_ dx = lim In |x — 2} + —In|a — 4| 
2 D ; 


b2- J, (a — 2)(x — 4) b2- 


As 6 is approaching 2 from the left, we know that all of the x’s that we are considering are less 
than 2 and, hence, |#—2| = 2—2 and |x—4| = 4-2. Now, evaluating at « = 6 and subtracting 


the value at x = 1, we find 


where we used that In1l = 0. As 6 approaches 2 from the left, 4 In(4 —b)- $1n3 approaches 
+ n2— $1n3, but > In(2 — 6) approaches (—1/2)(—0o) = oo. 


Therefore, 7 aes dx diverges, and so does the integral over the bigger interval [1, co). 


x—4) 
We should mention that, just because i. Cm ar dx diverges to oo, that does not imply that 


Ge 


i res dx also diverges to 00; other parts of the summand that we split the integral into 


may (and do) diverge to —oo. Thus, we simply say that [,* 


1 7 3 
DGD dx diverges, without 


trying to specify the manner in which it diverges. 
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We summarize our discussion and example above into a definition. In this definition, we will 


split a set into a union of closed or half-open intervals (as we did above); the point is that each 


6 


of the intervals that we split things into has a “problem”, an unboundedness issue, at, at most, 


one endpoint. It is important that the definition gives you the same result, regardless of how 
you pick the half-open intervals. The proof of this is in Theorem 


Definition 2.5.11. Let EF’ be a subset of the real numbers which is the union of a finite 
number of intervals. Let f be a real function, which is defined and continuous on the set E 
except, perhaps, at a finite set of points P. Let E — P denote the set of points in E which 
are not in the set P. 


Then, E—P is a union of a finite number of closed or half-open intervals I,, In, ..., In on 
which f is defined and continuous. Given any such decomposition of E — P into intervals, 
we define the integral of f on FE by 


[ ia = f(a)dx + f(xw)dx +... 4+ | f(a) dx, 
E i; In In 


provided that each integral in the summation on the right converges; in this case, we say 
that J), f(a) dx converges or that f is integrable on E. If any one of the integrals in the 
summation above diverges, then we say that iP f(x) dx diverges. 


If E is an interval [a,b], [a,b), (a, 6], or (a,b), then we may also use other notation; we set 


[soe = [ fea, and [ fas = — f Heya 


Note that, with our terminology, we may say that a function f is integrable on a set E, 
even if f is not defined at a finite number of the points in FE. For instance, in 
PeamplePie we would say that 1/x?/? is integrable on (—1,8), even though 1/2?/? is not 
defined at « = 0. Be aware that other books might use different terminology, and might not 
say that f is integrable on a set on which f is not defined. 
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Just as we have linearity for Riemann integrals, Theorem |2.3.19| we also have linearity for 
our more general integrals, provided the individual integrals converge. The proof is essentially 
identical, except that, to deal with the improper integrals, you must use that limits are “linear”, 


i.e., you can pull constants out of limits and split up sums. 


Theorem 2.5.12. (Linearity of Improper Integrals) Let E be a subset of the real 
numbers which is the union of a finite number of intervals. Suppose that f and g are 
integrable on E, and that a and b are any real numbers. Then, af + bg is integrable on E 
and 


[ ete) + bale) ae = af He) de + bf a(e)ae. 


Example 2.5.13. As we saw in Example and Example [2.5.9| (with different upper-limits 
of integration), the integrals 


M7 » if 
[ awe and | are 


both converge; we leave it as an exercise for you to show that they converge to 2 and 3, respec- 


tively. 
Therefore, 
1 
5 7 
7 (sm _ am) da 
converges, and equals 5-2—7-3=-—-11. 


It is sometimes possible to tell that an improper integral converges, without being able to 
determine what it converges to. This seemingly bizarre fact stems from a defining property 
of the real numbers: every non-empty set of real numbers, which has an upper bound, has a 
LEAST upper bound. See Definition[5-1-T5Jand Theorem[5.1.18] This least upper bound property 
is the main reason that we can sometimes tell that limits, as in improper integrals, exist without 


knowing the value. 


We first give a theorem which says that there’s only one way for some types of improper 


integrals to diverge. 


172 CHAPTER 2. CONTINUOUS SUMS: THE DEFINITE INTEGRAL 


Theorem 2.5.14. Let I be an interval of the form [a,b) or (a,b], where we allow the 
intervals [a,oo) or (—oo, b]. Suppose that, for all x in I, f(x) > 0, and that, for all closed 
intervals [c,d] contained in I, f is Riemann integrable on [c, d]. 


Then, if there exists a real number M (an upper bound) such that, for all [c,d| contained 
in I, ie f(x) dx < M, then i f(x) dx converges, and what it converges to is less than, or 
equal to, M; in other words, if there is an upper bound M on all of the he f(x) dx, then 
ft f(a) dx converges to the least such upper bound. 


In particular, of i f(a) dx diverges, what it diverges to is oo. 


Proof. See Theorem 


Remark 2.5.15. The analogous statement for f(a) < 0 is true, and is obtained by applying 
Theorem |2.5.14]to —f(a), which would be non-negative. 


What you find for non-positive f is: if there exists a real number M (a lower-bound) such 
that, for all [c, d] contained in I, M < fia f(x) dz, then i f(x) dx converges to the greatest such 


lower bound. In particular, if { f(x) dx diverges, what it diverges to is —oo. 


Corollary 2.5.16. Let I be an interval of the form [a,b) or (a,b], where we allow the 
intervals [a,co) or (—co, b]. Suppose that, for alla in I,0< f(x) < g(x), and that, for all 
closed intervals [c,d] contained in I, f and g are Riemann integrable on [c, d]. 


Then, if f? g(a) dx converges, then so does ie f(x) dx, and what it converges to is some- 


thing less than, or equal to, f? g(x) dx. This implies that, if fr f(x) dx diverges, then so 
does rh g(x) dx. 


Proof. This is easy now. Since f < g on I, Theorem |2.3.20)| tells us that, for all [c,d] contained 


in J, 
| Yeu / For 


If - g(x) dx converges, then Theorem|2.5.14| tells us that it converges to the least upper bound 
M on the integrals ie g(x) dx, but, by the inequality above, this M is an upper bound on all of 
the integrals i f(x) dx. The corollary now follows by applying Theorem]2.5.14| again. 
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Example 2.5.17. Consider the integrals 


114+sing = 
da: 
1 


——d d 
: Je x an 7 


Producing manageable anti-derivatives of these integrands is problematic/impossible. Nonethe- 
less, we can use Corollary |2.5.16]to determine quickly whether they converge or not. 


First, note that, for all x in (0, 1], 


1+sinz 2 2 


Sg oe 


and that, for all x in [1, 00), 
2+sinz 
—— 


Now, we find 


1 1 
2 a 2 =. v2|p 1/2) 
Lee 2a) ee ee ee 


and se 4 
| —dx = lim(Inb-Inl) = o. 
1 


x b—00 


114+sinz 
Jax 


co 9 ‘ 
(to something less than or equal to 4, but we don’t know what), while / a dx diverges 
i x 


Combining all of the above with Corollary|2.5.16} we conclude that dx converges 
0 


to oo. 


2.5.1 Exercises 


In Exercises 1 through 17, evaluate the given integral if it converges; otherwise, 


show that the integral diverges. 
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[ dx 
1. ; 
9 (20+5)? 
5 - dt 
Sj, Gre 
‘ i dx 
0 7x —_ 5 
ae 
1 (4y—3)1/? 
5. | sin z dz 
0 


2 du 
e [, (u+3)(u+ 5). 


10. 


14. 


TT 
d 
. | —“" | Hint: use a half-angle identity to evaluate the integral. 
9 l-—cosv 


1 

dx 
. Let n > 1. Does | — converge or diverge? If it converges, what does it converge to? 
© 7 


1 
dx : ‘ . 
. Let n = 1. Does i —, converge or diverge? If it converges, what does it converge to? 
(0) xv 


1 
dx , ‘ . 
. Let n < 1. Does i —, converge or diverge? If it converges, what does it converge to? 
0 ot 
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Use Corollary |2.5.16}to determine whether the integrals in Exercises 18 through 24 


converge or diverge. If they converge, you need not calculate the integrals. 


1 p-y 
21. / © dy. 
0 y 
Co ey 
22. / — dy. 
1 VY 


co t = 
23. | es dx. Hint: See Exercise 14 above. 
9 wv +5x+1 


oo 3 

24. / - I ds. Hint: Justify and use the fact that e* > 1+ a for all s > 0. 
1 &— 

dx 


converges. Assume k? < 1. 
T--)d-) 


1 
25. Prove that the elliptic integral / 
0 v( 


26. Formulate a similar comparison theorem for functions that grow without bound at some 
real number a. You may limit your statement to either a right or left-hand limit. 


CO 


1 
27. Find all real numbers p, if any exist, such that / = dx converges. 
x 


—co 


Co 


28. Find all real numbers p, if any exist, such that / dx converges. 


= xP 


CO 
Let ['(n) = e *x"~| dx. This function is called the gamma function and is ubiquitous 


in many branches of mathematics. 


29. Calculate T'(1), and T'(2). 


30. Prove inductively that ['(n) = (n — 1)! for n = 1, 2,3,..... 


A random variable is said to be an exponentially distributed if its density function 


jer” a > 0 
ney={2 eal 


is 


for some \ > 0. 
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31. What is E(X), the expected value of an exponential random variable? eo 
32. What is E(X7), if X is an exponential random variable? 


33. What is the variance, Var(X), of an exponential random variable? 


A data set (in statistics) or a random variable (in probability) is said to be normally 


distributed if its density function is e7 (2-H)? /20? | where x € (—oo, co). The graph 


oV2n 
of this density function forms the familiar Bell curve. 


34. Prove that the expected value of a normally distributed random variable is pu. 
35. What is E(X?) for a normally distributed random variable? 
36. Prove that the variance of a normally distributed random variable is 07. 


37. Recall that the cumulative distribution function of a continuous random variable is F(a) = 


y f(a) dx. F(a) is the probability that the random variable X is less than or equal to 


a. What is F(a) for an exponentially distributed random variable? Cc 


38. Using the notation of the previous problem, prove that lim F(a) = 1 if F is the cumulative 
a—oo 


distribution function of an exponentially distributed random variable. 


39. The survival function of a a random variable is the complement of the cumulative distribu- 
tion function. That is, if F(a) is the cumulative distribution function, then S(a) = 1— F(a) 
is the survival function. The name of this function can be understood from an actuarial 
perspective where the random variable is a life. Then S(a) is the probability that the life 


will survive longer than x years. What is S(a) for an exponential function? 


40. The moment generating function of a distribution provides an alternative method of cal- 


culating the mean and variance of the distribution, and is defined as follows: 
a(t) = Be) = fel fa)ae 


where f(x) is the density function. Prove that if X is exponentially distributed with 


parameter A, then (t) = — 


41. Show that ¢’(0) = E(X) if X is exponentially distributed. 


42. Show that $”(0) = E(X?) if X is exponentially distributed. These two results are not 
specific to the exponential distribution; they hold for arbitrary continuous distributions 


(Bonus exercise: try to prove this!) 
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43. 


4A, 


45. 


A6. 


AT. 


The amount of time it takes to wait in line to register for classes is exponentially distributed 
with mean 12 minutes (so A = 1/12). eo 


a. What’s the probability a student will have to wait for less than 12 minutes? 
b. What’s the probability a student’s waiting time will be between 10 and 20 minutes? 


c. What’s the probability a student’s waiting time exceeds 25 minutes? 


Recall that the mass m(t) of a decaying compound is given by m(t) = moe~*' where 
t > 0. However, this apparently exact formula is merely a model for a complicated 
physical process. Radioactive decay can be be viewed in probabilistic terms using an 


exponential distribution. First, note that e~*¢ 


is the survival function of an exponential 
random variable. Assuming the life of the atom is exponentially distributed, it makes sense 
to define the mean life of an atom as MI = i tre“ dt. Calculate the mean lifetime of 


the following atoms (A is given so that t is in years). 


a. uranium-239, \ = 1.55 x 10719. 
b. carbon-14, \ = 1.15 x 107°. 


c. tritium, \ = 0.056. 


Suppose we have 100 tritium atoms. How many atoms are anticipated to decay between 
8 and 12 years based on the model in the previous problem? Note that this is the same 
as asking for the probability that a single tritium atom will decay between 8 and 12 years 


from now. 


A random variable is gamma distributed if its density function is given by 


de (Ae) Pot 


Co 


where A > 0, p > 0 and « > 0. Show that | f(a) dx = 1. Hint: this is easier than 


0 
it sounds; it’s sufficient to show the numerator is equal to the denominator, and the 


denominator is a constant when the function is integrated with respect to x. 


The Fresnel integrals arise in a variety of applications, from optics to the design of railroad 
b? 
tracks. Let F(a, b) = | sin(a?) dx. 


a2 


1 si 
a. Make a substitution and show that F'(a,b) = — / 7 dy. 
a y 
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b. Integrate this expression by parts to show 


b 


siny _ cosa — cosb a 


avi Ve eI, 


c. Show that given € > 0, there exists N such that ifa > N and b> N, then F(a, b) < . 
d. Use the Cauchy criterion and the fact that iM = Fi — fj to conclude that limp... F(a, 6) 


converges. This shows that sin(x) dx converges. 
0 


co 
48. Mimic the argument in the previous problem to show that | cos(a”) dx converges. 
0 


Co 
49. Is | az sin(x*) dx convergent? Justify your answer. 
0 


C is 

sin & 

50. We showed that the improper integral i —— dz is convergent. This exercise shows that 
0 xv 


°° sin x 
this result can be extended when C approaches infinity, that is, | — dz is convergent. 
0 a 


bcs 
a. Assume 0 < a < b and let F(a, 6) =| o* dx. Shows that F (a,b) = F(a,a+7) — 
a. 
F(b,ob+7)+F(a+a,b+7). 


sin y 


b. Make a change of variables and show that F(a+7,b+7) = -{ dy. 


a YT 


b ‘ 

c. Show that 2F(a,b) = F(a,a+7) — F(b,b+7) 4 xf se de. Hint: Keep in 
a &(a+7) 

mind that x and y are just dummy variables of integration. We may as well use just 


one of them. 


d. Prove that |F'(a,b)| and |F'(b,b+ 7)| are both both bounded above by z/a. 


bog b 
d. 
e. Show by the comparison results above that 7 / ee: dx <a = 
q u(x+7) q 2? 
: 20 > dx 
f. Combine parts (d) and (e) to show that 2|F(a,b)| << —+7 ] —.- 
a x 


g. Show that given € > 0, there exists N such that a > N andb > N implies | F(a, b)| < e. 


°° sine 
h. Use the Cauchy criterion to conclude that | —— dx converges. 
0 x 


If f(t) is a continuous function on [0,0o), then the Laplace transform of f is the 


function F(s) = | f(t)e-*' dt. Laplace transforms provide a powerful technique for 


solving differential equations. Calculate the Laplace transforms of the functions 


2.5. IMPROPER INTEGRALS 179 


in Exercises 51 through 56. Note that s values for which the Laplace transform 


integral does not exist are not in the domain of the transformed function F(s). 


51. f(t) =1. 

52. f(t) = kt, where k is a constant. 

53. f(t) = e%, where a is a constant. 
54. f(t) =sin(ct), where c is a constant. 
55. f(t) =cosht. 

56. f(t) = sinht. 


57. Suppose p > —1. Recall the definition of the gamma function from above. Show that the 
Laplace transform of ¢? is F(s) = I'(p + 1)/s?*". In particular, if p is a positive integer, 
then F(s) = p!/s?*1. 
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~ 
2.6 Numerical Techniques ee" 


for Approximating Integrals 


In this section, it may seem like we’re backing up. In Section we looked at Riemann 
sums, then, in Section [2.3] we took limits of Riemann sums to define the definite integral, the 
continuous sum of infinitesimal contributions. Then, after seeing how tedious it is to calculate 
integrals as limits of Riemann sums, in Section [2.4] we presented the Fundamental Theorem of 
Calculus, which tells us that, if we have an anti-derivative F' of a continuous function f, then 
it’s easy to calculate the values of definite integrals of f, in terms of F’. Great. So, what’s our 


problem? 


Our problem is that there are continuous functions f for which we cannot produce man- 
ageable anti-derivatives, and so we cannot use the Fundamental Theorem to calculate definite 
integrals for such f. The classic example is f(x) = e-* . Definite integrals of this function 
are of fundamental importance in probability and statistics, and yet, as we mentioned in Re- 


mark|1.1.23| e~*” has no elementary anti-derivative. 
Thus, while the first part of the Fundamental Theorem, Theorem]2.4.7} tells us that definite 


integrals of continuous functions on closed intervals exist, our question, for functions f without 
“nice” anti-derivatives is: can we approximate fi f(a) dx in a better way than by using simply 


Riemann sums? 


66. ” 


The answer to this question is: “yes”. We will look at two “rules” for approximating the 
values of definite integrals: the Trapezoidal (or Trapezoid) Rule and Simpson’s Rule. Both of 
these rules use summations that look similar to Riemann sums, but the summations are, in fact, 


not Riemann sums. 


The Trapezoidal Rule is very easy to derive. We approximate the graph of f, the function 
we want to integrate, by using line segments between points on the graph, and (assuming for the 
convenience of this discussion that f > 0) we then find the area of the region, a trapezoid, below 
each line segment and above the relevant interval. Adding these areas gives an approximation 
of the corresponding definite integral. 


It turns out that, for many functions, the approximation of the definite integral using the 
Trapezoidal Rule is worse than the approximation using midpoint Riemann sums (Riemann 
sums in which all subintervals have the same length, and the sample points are the midpoints 
of the subintervals; see Example[2.2.8). Hence, in some sense, the Trapezoidal Rule is useless. 


However, Simpson’s Rule is just as easy to use as the Trapezoidal Rule, and yet, approxi- 
mating an integral using Simpson’s Rule is usually stunningly more accurate than using the 


Trapezoidal Rule or using midpoint Riemann sums. Simpson’s Rule is based on approximating 
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the graph of f with a collection of parabola segments, instead of with the line segments used 
in the Trapezoidal Rule. Since graphs of typical functions curve, and parabolas curve, it should 


seem reasonable that Simpson’s Rule is typically more accurate than the Trapezoidal Rule. 


Below, we first adopt some notation and remind you what a midpoint Riemann sum is. After 
that, we discuss and derive the Trapezoidal Rule, and then discuss and state Simpson’s Rule, 
leaving the actual derivation of Simpson’s Rule for the Technical Matters section, Section [2.A] 
We also give statements regarding bounds on the errors when you use the three approximations; 
the proofs of these bounds are also in Section [2.A] We first give examples in which we do not 
consider the error bounds. Finally, we give an extended example where we discuss the error, 


and bounds on the error, in more detail. 


Suppose that we have a function f on a closed interval [a, b}. 


We subdivide the interval [a,b] into n subintervals of equal length Ax = (b—a)/n. For 0 < 
k <n, the corresponding partition of [a, bj, i-e., the collection of endpoints of the subintervals, 


is given by a, =a+kAz, that is, 


t=a, 4 =at+Ag, cg =a4+2Ag, ... ,en-1 =at+(n—1)Az, rz, =a+nAc =b. 


For 1 < k < n, the k-th subinterval is the closed interval [z,~1, 2%]; its midpoint is sp = 
(p14 + xy) /2. 


If A is an approximation of ft f(x) dz, then we refer to the difference H = fie f(x)dx—A 
as the error in the approximation.. As we usually care about whether A is within plus or minus 
some amount of the actual value of the integral, it is usually the absolute value of F that we 
are interested in. We refer to the absolute value of the error as the absolute error. 


We should remark that we will give all our calculations below to 12 decimal places. This may 
seem like ridiculous precision; however, when you’re trying to compare various approximations to 
each other, you want to go out to enough decimal places to actually see where the approximations 
differ. 


We first use midpoint Riemann sums to approximate the definite integral. Graphically, we 
view the midpoint Riemann sum (for f > 0) as shown in Figure 
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ZZ 


KZ 


L 


2 3 xX, 


Figure 2.37: A midpoint Riemann sum. 


Definition 2.6.1. (Midpoint Approximation for Integrals) Using the notation above, 
the midpoint approximation of , f(x) dx, using n subintervals, is 


b 
[ fae ~~ ax |s (22) cbs (252) ree ee (2 


Example 2.6.2. Consider the integral i e-* dx. A calculator will tell you, correctly, that, to 
12 decimal places, the value of this definite integral is 0.746824132812. What approximations 


do we obtain from the Midpoint Approximation using n = 2 and n = 4? 


When n = 2, we have Ax = (1 — 0)/2 = 1/2, a) = 0, 1 = 1/2, and x2 = 1 and, of course, 
f(z) =e. 
The Midpoint Approximation is 


1 
| en dx & E leo ge Biy") 
0 2 


which, to 12 decimal places is 0.754597943772. Thus, the absolute error here is 


|0.746824132812 — 0.754597943772| = 0.00777381096; 
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not great, but pretty good, considering that we used only 2 subintervals. 


Let’s try n = 4. Now, Az = (1—0)/4 = 1/4, a = 0, a1 = 1/4, v2 = 1/2, x3 = 3/4, and 


y= 1. 


Our new Midpoint Approximation is 
oe tf Ue 4 260 Ge. Arey? 
e dx & Z le +e +e +e 
0 


which, to 12 decimal places is 0.748747131891. Thus, the absolute error here is 


|0.746824132812 = 0.748747131891| = 0.001922999079; 


this is roughly 1/4 of the absolute error that we had when we used n = 2. 


Now let’s look at the Trapezoidal Rule. As we stated at the beginning of the section, 
the Trapezoidal Rule does not involve Riemann sums, but rather something that looks similar 
to Riemann sums. The idea of the Trapezoidal Rule is simple; over each of our subintervals 
[vx-1, 2%], you approximate the function f by the unique linear function L;,(%) = mga + dx 
whose graph passes through the points (x,_1, f(@p_1)) and (az, f(xp)), ie., on the graph of 
f, you “connect the dots” (with line segments) between the points of the graph of f which 
correspond to the ends of the subinterval. 


ll, 


Figure 2.38: Typical areas involved in the Trapezoidal Rule. 


X4 


At this point, instead of integrating f, you calculate each of the integrals I, of the linear 
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functions Ly(a) = mga + by over the interval [7,~1, 2%], and then you add these integrals 
together. What you obtain for J; is 


fe = 2 [flea) + Flen)]. 


If f is positive, you should recognize this as the area of a trapezoid: one half the “height” (here, 
it’s the width) times the sum of the lengths of the bases. Adding these together, we find the 


approximation 


b 
[i@wrht ht hte thatl = 


A= | (00) + f(x1)| + I (01) + f(x2)| + A Flos) + F(0s)] fe tape 


Ax Ag 
| F (na) + F(tn—1)| + S| Fl@n—a) + Fon). 
Factoring out the af and combining the pairs of overlapping terms everywhere, except for at 


the f(aq) and f(x,) terms, we obtain the following approximation. 


Definition 2.6.3. (Trapezoidal Rule) Using the notation above, the Trapezoidal Rule 
: : b ; 
Approximation of [° f(x) dx, using n subintervals, is 


b 
ff seorae ~ FP [ tao) + 2F er) + 2f (ee) + +241) + Fen) 


Note that the pattern of the coefficients for f(z;,) in the Trapezoidal Rule is that the first 
and last coefficients are 1’s and, aside from that, the coefficients are always 2’s. 


Example 2.6.4. As in Example 2.6.2} consider the integral ri e-* dx. Recall that, to 12 
decimal places, the value of this definite integral is 0.746824132812. What approximations do 
we obtain from the Trapezoidal Rule using n = 2 and n = 4? 


When n = 2, as before, we have Ax = (1 — 0)/2 = 1/2, a = 0, a1 = 1/2, and x2 = 1. 
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The Trapezoidal Rule Approximation is 
ne PAG 9G 40 
e dx & i le + 2e +e ; 
0 


which, to 12 decimal places is 0.731370251829. Thus, the absolute error here is 


|0.746824132812 — 0.731370251829| = 0.015453880983. 


Note that this is roughly double the absolute error that we had when we used the midpoint 


Riemann sum with n = 2. 


What about when n = 4? As before, Ax = (1 — 0)/4 = 1/4, ao = 0, a) = 1/4, xo = 1/2, 
v3 = 3/4, and x4 = 1. 


Our new Trapezoidal Rule Approximation is 


1 
| en dx & : Jeo” + 2e~ V/A)? 4 2e-A/2)" 4 De (8/4)? 4 e- (D?| 
0 


which, to 12 decimal places is 0.742984097800. Thus, the absolute error here is 


|0.746824132812 — 0.742984097800| = 0.003840035012. 


As with the Midpoint Approximation, this new approximation, using the Trapezoidal Rule with 
n = 4 has roughly 1/4 of the absolute error that we had when we used the Trapezoidal Rule 
with n = 2. However, you should also notice that it is still true that the absolute error, with 
n = 4, using the Trapezoidal Rule, is roughly double the error, with n = 4, using the midpoint 


Riemann sum. 


What is Simpson’s Rule? In a sense, Simpson’s Rule is the next step after the Trapezoidal 
Rule. The Trapezoidal Rule takes pairs of successive points on the graph of f, as determined by 
the endpoints of the subintervals. Two points determine a line, and so determine a linear function 


L=mz2«-+b. On the k-th subinterval, we approximate f by the linear function Ly, = mzx + by 
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and integrate this linear function over the integral [7,_1, 7x], instead of integrating f. Then, we 
add together the approximations over all of the subintervals. 


Simpson’s Rule uses the fact that three points determine a parabola (or a line, if the points 
are collinear), and so determine a unique polynomial g = q(x) = ax? + ba +c of degree less than 
or equal to 2. Simpson’s Rule takes three successive points on the graph of f, as determined 
by the endpoints of the subintervals, and approximates f by the corresponding polynomial 
q = q(x) = ax? + br + € over the two subintervals whose endpoints correspond to the three 
points we took on the graph. For instance, over the two successive subintervals [%9,21] and 
[%1, £2], ie., over the interval [9,22], Simpson’s Rule approximates f by the unique function 
q(x) = ax? + bx + c whose graph passes through the three points (a9, f(x)), (71, f(a1)), and 
(x2, f(x2)). 

When you determine the function q(x) and integrate it over [xo, x2], what you find is relatively 
simple. You get ae [f(x0) +4f (a1) + f(x2)| . (See Peopstion- We can do this for pairs 


of subintervals, and approximate the entire integral ihe f(x) dx, provided that n, the number 


of subintervals, is even. 


Figure 2.39: Typical areas involved in Simpson’s Rule. 


Note that we had to make Figure unusually large for you to have any hope of seeing 
that the dotted parabolas do not exactly fit the graph. 


Assuming that n is even, if we add together the contributions over pairs of intervals, the 
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approximation that we obtain is 


[ seve ~ 


AZ| Flo) + Af er) + Flea) + AZ | Flos) + 4f (as) + Flea) ds, age 2p 
A [Flens) + 4f (an—3) + f(%n-2)] + A [Flen2) +4f(¢n—1) + f(an)]. 


3 3 


Factoring out the an and combining the pairs of overlapping terms everywhere, except for at 
the f(xo) and f(#,) terms, we obtain the following approximation. 


Definition 2.6.5. (Simpson’s Rule) Using the notation above, the Simpson’s Rule 
approximation of ib f(x) dx, using n subintervals, where n is even, is 


b 
[Soya = FA [Ple) + ay ler) +24 lea) +f as) + 2flea) + +4 ent) + Fl) 


Note that the pattern of the coefficients for f(xz,) in Simpson’s Rule is that the first and last 
coefficients are 1’s and, aside from that, the coefficients alternate 4, 2, 4, 2, etc. The next-to-last 
coefficient will always be a 4, due to the fact that n is even. 


Example 2.6.6. As in our previous two examples, consider the integral i. e~® dx. Recall that, 
to 12 decimal places, the value of this definite integral is 0.746824132812. What approximations 


do we obtain from Simpson’s Rule using n = 2 and n = 4? 
When n = 2, we still have Ax = (1 — 0)/2 = 1/2, a9 = 0, 1 = 1/2, and xq = 1. 


The Simpson’s Rule Approximation is 


1 
| en dx & : leno” + de 1/2)" 4 e()"] 
0 


which, to 12 decimal places is 0.747180428910. Thus, the absolute error here is 


|0.746824132812 — 0.747180428910| = 0.000356296098. 
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Wow! This absolute error is roughly 1/20 of the absolute error using n = 2 for the midpoint 
Riemann sum, and is even roughly 1/5 of the absolute error using n = 4 for the midpoint 
Riemann sum (and, remember, the midpoint Riemann sums gave better approximations than 
the Trapezoidal Rule). 


Thus, what we see is that the approximation via Simpson’s Rule, even with a smaller n 
value, is significantly more accurate than those obtained from midpoint Riemann sums or the 


Trapezoidal Rule. This is typical. 


Let’s see what happens when n = 4. As before, Az = (1 — 0)/4 = 1/4, a = 0, a1 = 1/4, 
vq = 1/2, x3 = 3/4, and a4 = 1. 


Our new Simpson’s Rule Approximation is 


1 
| en dx & 5 Jeo” + 4e7 1/4)" 4 99 1/2)? 4 ge 8/4)? 4 eG)? | 
0 


which, to 12 decimal places is 0.746855379791. Thus, the absolute error here is 


|0.746824132812 - 0.746855379791| = 0.000031246979. 


This new approximation, using Simpson’s Rule with n = 4 has roughly 1/10 of the absolute 


error that we had when we used Simpson’s Rule with n = 2. 


It is reasonable to ask if it is possible to make precise in what sense midpoint Riemann sums, 
the Trapezoidal Rule, and Simpson’s Rule are “reasonable” approximation methods. Basically, 
the question is: what can an instructor say to a student who wants credit for claiming “ ih e-® dx 


is approximately 1,000, 000, it’s just that the error is really large”? 


A possible answer is provided by the following three theorems, which we prove in the Techni- 
cal Matters section, Section [2.A] These theorems give upper bounds on the absolute error when 
approximating an integral by using midpoint Riemann sums, the Trapezoidal Rule, and/or 
Simpson’s Rule, provided that the integrand possesses enough derivatives. In particular, these 
theorems imply that, for all three approximation techniques, the error approaches zero as n 
approaches oo. 
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Theorem 2.6.7. Suppose that a < b and the midpoint Riemann sum with n subintervals of 


equal length Ax is used to approximate ie f(a) dz. 


If f(x) exists for all x in some open interval containing the closed interval |a,b| and, 
if there exists a number M > 0 such that, for all x in [a,b], |f’(x)| < M, then the absolute 
value of the error, Emapt, satisfies the inequality 


M(Az)?n M(b—a)3 
< =— 
|Emapt] S 24 24n? 


Proof. See Theorem|2.A.15 


If, in fact, f” above is continuous on the interval [a,b], then the best, i.e., smallest, value 


that you can use for M above is the maximum value of f” on [a, }]. 


Theorem 2.6.8. Suppose that a < b and the Trapezoidal Rule with n subintervals of equal 
length Ax is used to approximate ft f(a) dz. 

If f(x) exists for all x in some open interval containing the closed interval [a,b] and, 
if there exists a number M > 0 such that, for all x in [a,b], |f’(x)| < M, then the absolute 
value of the error, Etrap, satisfies the inequality 


M(Az)?n 
12 


|Etrap| < 


Proof. See Theorem 


Remark 2.6.9. Note that our upper bound for the absolute value of the error in approximating 
with the Trapezoidal Rule is precisely twice the upper bound that we had for approximating 
with midpoint Riemann sums. This is why references frequently say that midpoint Riemann 
sums generally provide a better approximation to definite integrals than does the Trapezoidal 
Rule. Understand, however, that the general upper bounds on the absolute values of the errors 
that appear in the theorems don’t have to be very close to the true absolute values of the errors; 


they are simply gross upper bounds. 


190 CHAPTER 2. CONTINUOUS SUMS: THE DEFINITE INTEGRAL 


In particular, the absolute value of the error using the Trapezoidal Rule can be strictly 
smaller than what you obtain from midpoint Riemann sums. That’s just not what we expect, 


in general. 


Theorem 2.6.10. Suppose that a < b, n is even, and Simpson’s Rule with n subintervals 


of equal length Ax is used to approximate i f(a) dx. 


If f(x) exists for all x in some open interval containing the closed interval [a,b] and, 
if there exists a number M > 0 such that, for all x in [a,b], |f“(x)| < M, then the absolute 
value of the error, Esimp, satisfies the inequality 


|Esimp| < 


Proof. See Theorem |2.A.17 


Remark 2.6.11. If f(x) is a quadratic polynomial, then it should not be surprising that 
Simpson’s Rule would give the exact value of the integral; after all, Simpson’s Rule is obtained 


by approximating the given function with quadratic functions. 


What is somewhat surprising is that, even if f(a) is a cubic polynomial, the error bound 
above tells us that Simpson’s Rule still yields the exact value of the integral, for the fourth 
derivative of a cubic polynomial is 0, and so the M in the error statement could be taken to be 
0. 


Let’s look at an example for which we can produce an easy anti-derivative, and can easily 
find bounds on the second and fourth derivatives, so that we can see how two different approx- 
imations compare with the exact answer. Understand, however, that the true value of these 


: : Ae : 1 _»?2 : ; ae 
approximations is in cases like 1 e-” dx, where we can’t produce a convenient anti-derivative. 
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Example 2.6.12. Consider 


Let’s approximate [- = = dx using midpoint Riemann sums, the Trapezoidal Rule, and Simp- 
son’s Rule with both n = 2 and n = 4. First, though, let’s go ahead and calculate the theoretical 
error bounds, so that we can see how our actual errors compare with the bounds. We will also 
calculate how big we would need to pick n to have an error bound in each of the three approxi- 


mations that would guarantee an error of less than 0.00001. 


So, we have f(x) = 1/a? = 2~?, a=1, and b = 2. We calculate the derivatives that we need 
for the error bounds: f’(x) = —2x73, f”(«) = 6x~4, f(x) = —24a-5, and f(x) = 1202. 
Thus, the maximum value of | f’’(x)| on the interval [1, 2] is 6, and the maximum value of | f (x)| 
on [1,2] is 120. 


Therefore, when n = 2, we know that 


6(2 —1)8 1 
Emaptl < —7-~55- = aq = 0.0625; 
Emapt] S 54792 16 
6(2 — 1)3 1 
Ge a es ae EO 
|Ftrap| ea 12. 92 8 0 125; 
a 120(2 — 1)° 1 
Esimp| < ———_ = 0.0416. 
|Esimp| 180-24 A 0.0416 
When n = 4, we find 
6(2 —1)8 1 
Em < —— = — = 0.015625; 
|Emapt] S Sy Gq = (9.015625 
6(2 —1)8 f 
E < “2S = = = 0.03125; 
| | = 12.42 39 0.03 5; 
- 120(2 — 1)8 1 
|Esimp| < Ce ae = 0.00260416. 


180 - 44 384 
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You need to realize that these numbers are bounds on the possible amounts of error, not 
the actual errors. In fact, there’s a very important point here; if we know precisely what 
the error in using an approximation for an integral is, then we know the actual value of the 
integral; take the approximation and add/subtract the error. Of course, in this example, 


we do know the actual value of the integral, but — we’ll write it again — the value of these 
approximation techniques is that we can use them in cases where we do not have a way to 
calculate the exact integral. Thus, the most that we can hope for, typically, is to calculate 
a nice bound on the error, not to calculate the error itself. 


How many subintervals would we need to use with each of the approximations to produce 
bounds on the absolute values of the errors that would guarantee our approximation is within 
+0.00001? 


With midpoint Riemann sums, we would need 


62-18 1 
24-n2  — 100,000’ 


|Emapt| < 


that is, we must require 4n? > 100,000. Thus, we would need n > ./25,000 ¥ 158.11, ie., the 
smallest n that makes the upper bound on the absolute value of the error less than or equal to 
0.00001 is n = 159. 


It is possible that a smaller n actually yields an approximation within the desired amount, 


we just can’t determine such an n from our theoretical bound. 


With the Trapezoidal Rule, we would need 


_ 1)3 
62-18 ~ 1 


Eira < — ? 
|Ferap| 12-n? 100, 000 


that is, we must require 2n? > 100,000. Thus, we would need n > \/50, 000 © 223.61, i-e., the 
smallest n that makes the upper bound on the absolute value of the error less than or equal to 
0.00001 is n = 224. 


Again, it is possible that a smaller n actually yields an approximation within the desired 


amount, we just can’t determine such an n from our theoretical bound. 


With Simpson’s Rule, we would need 


120(2 — 1)° 1 


Beccl < 
[Esimp] S eq pa = 100, 000 
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that is, we must require 3n+/2 > 100,000. Thus, we would need n > 16.07, i-e., the smallest 
n that makes the upper bound on the absolute value of the error less than or equal to 0.00001 
would be n = 17 if we didn’t need an even n for Simpson’s Rule, but we do. So, the smallest n 


that we can actually use for Simpson’s Rule is n = 18. 


We'll write it for a final time: it is possible that a smaller n actually yields an approximation 


within the desired amount, we just can’t determine such an n from our theoretical bound. 


You can see from the numbers above why Simpson’s Rule is the preferred approximation 
method, and why, in general, we prefer midpoint Riemann sums to the Trapezoidal Rule. All 
three approximations are roughly equally as complicated to calculate for a fixed n, and yet, we 
can know that Simpson’s Rule with n = 18 gives us a bound on the error that requires n = 159 
for midpoint Riemann sums, and n = 224 for the Trapezoidal Rule. This is typical. 


Okay - it’s time to actually calculate the approximations of te ay dx when n = 2 andn=4. 
en=2: 

Ag = (b—a)/2 =1/2, x9 =1, 1 = 3/2, v2 = 2. 

midpoint Riemann sum: 


i pee 1 1/16 16 
! = ~ 0.4832 122. 
2 (am aa) 5 ( + ) 0.483265306 


Trapezoidal Rule: 


Simpson’s Rule: 


1/1 1 1 an 
(3 +4- (3/32 + =) = 0.504629. 

As the actual value of the integral is 0.5, we see that the absolute values of the errors 
in the three cases are (to within 12 decimal places), respectively, 0.016734693878, 0.03472, and 
0.004629. Recall that the upper bounds on these, which we calculated earlier, were, respectively, 
0.0625, 0.125, and 0.0416. What we see is that the upper bounds are certainly upper bounds, 
but that these upper bounds are not particularly close to the actual absolute values of the errors. 


This, too, is typical. 
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en=4: 
Ag = (b—a)/4=1/4, xo =1, 41 = 5/4, vo = 3/2, v3 = 7/4, v4 = 2. 


midpoint Riemann sum: 


1 1 1 1 1 
4 (om = (11/8)? + (13/8)2 7 asa) ~ 0.495547936480. 


Trapezoidal Rule: 


1 1 1 1 1 1 
+ 2: L 2. 2. ——___. + __ = 0.508993764172. 
8 (3 (/4? BP ae ») 


Simpson’s Rule: 


17a 1 1 il 1 
4. 2: 4. = 0.500417611489. 
(at gap t? aap tt wae tz) 

Here, we see that the absolute values of the errors in the three cases are (to within 12 decimal 
places), respectively, 0.00445206352, 0.0089937641720, and 0.000417611489. The upper bounds 
on these, which we calculated earlier, were, respectively, 0.015625, 0.03125, and 0.00260416. 
Again, our upper bounds are clearly upper bounds, but they are not very close to the actual 


absolute values of the errors. 


Remark 2.6.13. Before leaving this section, we should make a final remark. These days, all 
scientific calculators can calculate definite integrals. How do they do this? They use a form of 
Simpson’s Rule, with many, very small, subintervals, so that the result is typically accurate to 
within the number of significant digits that the display can handle. 


2.6.1 Exercises 


For the definite integrals in Exercises 1 - 15, calculate (a) the exact value of the 


definite integral, (b) a midpoint approximation, (c) a trapezoidal approximation, 
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and (d) an approximation using Simpson’s Rule. For each of the approximations, 
use n = 4. View (a) as an opportunity to practice integration techniques from the 


previous chapters. 


NO 
8 
Ny 
? 
= 
a 
Q 
8 


a= - 


w 
os 
fh 
° 
8 
i) 
| Ne) 
8 
ow 
8 
) ie 
(on) 
= 
fan) 
Q 
8 


4 
4. | cosh(4s) ds. 


—4 


6 442 
At -_ 
5. i; ica a 


12 
3 
eee, | 
0 Vax-+4 


n/6 
7. | tan(20) d0. 
0 


5 
8. / Inhdh. 
1 


[ =32° + Be? + 62-5 
2 


2 — 22 


dz. 


6 
10. i) V 36 — ¢? dt. 
) 


16 
11 a d 
‘ = aM: 
12 V m2 = 9 


1 
i) / jg, 
9 (44402)? 


7 2 
2p? +1 
13. | PF dp 
3 Po +P 


20 
14. | sin? ddd. 
0 


0 
15. / Vu? +127 + 45 dz. 
6 
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b 
16. Prove that the midpoint approximation is in fact equal to / mea dx for arbitrary n. Hint: 
a 


show [*'** ma + bdz is equal to the area of the rectangle approximating it. 


Determine n, the number of intervals “necessary” to give an approximation of the 


definite integral using the specified method with absolute error less than that given. 
m/4 
17. | sin(40) dé, Midpoint, Error< 0.001. 
0 


m/4 
18. i sin(40) dé, Trapezoid, Error< 0.001. 
0 


14238 
19. i 97x? — 56rx7 + 42ex — 500 dx, Simpson’s Rule, Error< 0.000001. 
5134 


Estimate each of the following unpleasant definite integrals using (a) the Midpoint 
Rule, (b) the Trapezoidal Rule and (c) Simpson’s Rule. 


5 oa 
20. ‘i © dr,n=4. 
1 x 


21. / sin(sint) dt. n = 6. 
0 


m/2 
22. sin(y*) dy. n = 8. Note: this is one of the Fresnel integrals, introduced in the 


0 
exercises of the previous chapter. 


23. 10. 


a dz 
SS DES 
0 VztA+1 


1/2 
24. / dE) ie n=6. 
0 1l+a 


” dt 
25. Use the fact that Ina = | ap and the Trapezoidal Rule with n = 10 to approximate In 2. 
1 


26. Redo the previous problem using n = 10 and Simpson’s rule. 


1 

d 

27. Use the formula “ = i ne to estimate 7. Use the trapezoid formula with n = 10. 
0 Yy 


28. Redo the previous problem using n = 10 and Simpson’s rule. 
In Exercises 29 - 32, calculate an upper bound for the absolute error of the definite 


integral in terms of n when approximated with (a) the Midpoint Method and (b) 
the Trapezoidal Rule. 


2.6. 
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29. 


30. 


3l. 


32. 


33. 


34. 


35. 


[4% 

1 yo 

27 

i sin? t dt. 
0 

12 

| e* dz. 
0 


1 
a e~” dx. Hint: To find the maximum absolute value of the derivatives, note that all 
-1 


derivatives are of the form p(x)e~® 


where p(x) is a polynomial. It takes some fortitude to 
factor these polynomials and correctly apply the first or second derivative tests, but some 


of the roots are obvious. 
3 
What is the upper bound on the absolute error of / e® dx when estimated with Simp- 
0 
son’s Rule? Leave your answer in terms of n. Hint: even though Simpson’s Rule requires 
more derivative taking than the Midpoint Method, this problem is easier than the pre- 


vious problem since the polynomial appearing in the 4th derivative is easier to analyze. 


Generalize the previous problem. Let E(n,b) be the upper bound of Simpson’s approx- 
b 


imation of J e® dx when (0, b] is subdivided into n equally spaced intervals. What is 
E(n,b)? 


b 


Prove that if n = 4, the Simpson’s approximation of / x dx is exact. 
a 


A random variable is said to follow the standard normal distribution if it is normally 


distributed with mean zero and standard deviation one (4 = 0, 0 = 0”? = 1). Use this 


information to solve Exercises 36 - 40. 


36. 


37. 


38. 


—x? /2 


Show that the density function of a standard normal distribution is € 


1 
V2 
a. Suppose a data set follows a standard normal distribution. Use the Midpoint Rule 

with n = 4 to approximate the proportion of data between 0 and 1. 


b. Based on your result in (a), approximately how much of the data falls between —1 
and 1? Hint: the density function is symmetric about the y-axis. 

a. Suppose a data set follows a standard normal distribution. Use the Trapezoidal Rule 
with n = 4 to approximate the proportion of data between 0 and 2. 


b. Based on your result in (a), approximately how much of the data falls between —2 


and 2? 
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39. 


AO. 


Suppose that the mature height of a certain tree species is normally distributed with 
mean 16 feet and standard deviation of 2 feet. What proportion of trees are between 14 
and 18 feet tall? Hint: given a normal distribution with parameters yp and 0, make the 
substitution z = ~—* before evaluating. You should be able to use one of the prior two 


or 
problems to help answer this question. 


Suppose a test is administered to gauge the effect of alcohol on driving ability. A large 
sample of individuals is given one alcoholic beverage, and their response time to a certain 
stimulus is measured. Suppose the response time is normally distributed with mean three 


seconds and standard deviation 0.6 seconds. What , of the individuals’ response 


times falls between 1.8 and 4.2 seconds? 


Up to this point, we’ve used the various methods in this chapter to approximate 


definite integrals of a given continuous, or at least integrable, function. Since these 


approximations depend only on knowledge of the function at finitely many points, 


they may be adapted to applications where a definite integral is required, but the 


integrand is known only at finitely many points. 


Al. 


A2. 


Recall that if the force exerted on an object x meters from the origin is given by f(x), then 
b 


the work done in moving the object from a to b meters from the origin is W = f(x) dx. 
a 

Suppose a 3 kilogram mass is being moved from 9 to 17 meters from the origin and the 

acceleration of the particle is measured every 2 meters in meters per second per second as 


shown in the table below. Use the Midpoint Method to approximate the total work done. 


x | Acceleration 
10 2.5 

12 5 

14 2 

16 -1 


Suppose that a device is placed in a race car that measure the instantaneous velocity at 
two second intervals. The data is shown below. Use Simpson’s Rule to estimate the total 


distance traveled by the car between t = 0 and t = 14 seconds. 
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Time (t) | Velocity (miles/hour) 

0 0 

2 15 
4 40 
6 66 
8 82 
10 92 
12 108 
14 116 

43. Redo the previous problem using the Trapezoidal Rule. o 


Estimate each of the following definite integrals using the specified method and 
number of partitions. Then make a conjecture about the value of the (possibly 
improper) integral. As a side note, each of these integrals have the property that 
they can be calculated directly using a powerful tool from complex analysis called 


residues. 


100 
1 
44, i ee dx, Midpoint Method, n = 10. Make a conjecture regarding the value of 
0 x 


ae 
es 
9 l+2 


45. / = = dz, Trapezoidal Rule, n = 10. Make a conjecture regarding the value of 
=10 4°91 4 


+ 3cost 


[ dt 
_, D+3cost 


An ordinary or simple pendulum is one in which the path of the pendulum is 


dt 
A6. / 54 3cost? Simpson’s Rule, n = 8. Make a conjecture regarding the exact value of 


assumed to trace out a portion of a circle. This contrasts with a cycloidal pendulum 
where the path is a cycloid. Assuming no friction, the period 7 of an ordinary 


pendulum can be stated in terms of the elliptic integral 


= Lo 1 dw 
c I, (1 = w2)(1 = w? sin?(4/2)) 


where L is the radius of the path, g is the acceleration due to gravity, and A is the 


amplitude of oscillation. That is A measures the angular position of the pendulum 


when it is first dropped. 
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47. Suppose A = 7/2, g = 9.8 m/sec? and L = 1 meter. Use Simpson’s Rule with n = 4 
to estimate the period. As the integral is improper, make your estimation of the interval 
[—.95, .95]. 


48. Redo the previous problem with A = 7/4. 


49. Argue that if the period is small, the period can be approximated by 


9 L i. dw 
g JaVl—w 


This expression is therefore independent of the amplitude. Evaluate this integral directly 


using L = 1 meter. 


50. Is your answer to the previous problem closer to the Simpson’s approximation with A = 
m/2 or A= 7/4? Why does this make sense? 
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Appendix 2.A Technical Matters 


In this section, we want to describe the technical framework for proving many of the theorems 
on definite integrals. Still, there are many details beyond the scope of what we want to present 


even in this technical appendix; our primary reference for these details is [4]. 


Theorem 2.A.1. The limit of Riemann sums, the definite integral, given in Definition|2. 3. 1] is 


unique, if it exists. 


In addition, if f is Riemann integrable on the interval [a,b], c f(xz)dx = L, and 
(Pn, Sn) is a sequence of sampled partitions of [a,b] such that lim ||P, || = 0, then 


lime Ree yy = i 


n—oco 


Proof. Clearly, proving the sequence statement proves the entire theorem. 


Suppose that p f(x) da exists and equals L, and suppose that (Pn,S,) is a sequence of 
sampled partitions of [a,b] such that lim ||P, || = 0. 
n—-oco 


Let € > 0. Then, by definition of the Riemann integral, there exists 6 > 0 such that, for all 
partitions P = {xo,...,2n} of [a,b], such that ||P|| < 6, for all sample sets {s1,...,8,} for P, 


S- f(si)Aa; — Lil <e. 
i=1 
As lim ||P, || = 0, there exists a positive integer N such that, for all n > N, || Pn|| < 6. 


Hence, for alln > N, if Py, = {x0,..-,¢n} and S, = {51,...,8n}, 


n 


do f(si)Axi - L 


ae; 


that is, 


Before stating any further results, we remark that Theorem tells us that unbounded 
functions (see Definition |2.3.4) are not Riemann integrable. Consequently, below, we will be 
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concerned with the case where f is a bounded function, until we get to the results on improper 


integrals. 


It is a defining property of the real numbers that every non-empty set of real numbers, which 
is bounded above, has a least upper bound or supremum. It follows, by negating, that every non- 
empty set of real numbers, which is bounded below, has a greatest lower bound or infimum. For a 
bounded set V of real numbers, you should think of the supremum and infimum as the numbers 
that “want” to be the maximum and minimum values, respectively, in V, though neither the 


supremum or infimum need actually be contained in V. 


Now suppose that f is a bounded function (see Definition|2.3.4) on a set EF of real numbers. 
This means that the set V = f(£) = {f(x) | « € E} is bounded, and so possesses a supremum 
and infimum; we denote these, respectively, by M;(E) and m,(£). 


The Extreme Value Theorem, (see [2] or [4]), tells us that a continuous function f on a closed 
interval [a,b] is bounded and that, in fact, My([a, b]) and m,((a,b]) are in the set f({a,b]), so 


that they are, respectively, the maximum and minimum values of f on [a, }J. 


Now, suppose that f is bounded on the closed interval [a,b], and that P = (2,...,%n) 
is a partition of [a,b]. We would like to pick sample points which maximize or minimize the 
Riemann sums, but such sample points need not exist (though they would if f were continuous). 
However, we can “fake” it by using the supremum and infimum of f on each subinterval of the 
partition. Thus, we define the upper and lower sums Us(P) and L;(P), respectively, of f with 
respect to the partition P to be 


Us(P) = > Mg([ai-1, 2i]) As, 


i=l 


and 


Sm ([zi-1, 2i]) Asi. 


i=l 


Ls(P) 


It follows immediately that U;(P) is the least upper bound of all of the Riemann sums of f, 
using the partition P, and that Ly(P) is the greatest lower bound of all of the Riemann sums 
of f, using the partition P. 


Note that it is immediate that, if a partition P* is a refinement of a partition P, then 


Us(P*)—Ls(P*") < Us(P) — Lg (P). 
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The fundamental integrability result that we need, for bounded functions, is the following, 
which we state without proof; see Theorem 3.2.7 of [4]. 


Theorem 2.A.2. Suppose that f is bounded on the interval [a,b]. Then, f is Riemann integrable 


on [a,b] if and only if, for alle > 0, there exists a partition P of [a,b] such that 


U;(P) _ Ly (P) <e€. 


Given Theorem [2.A-2] the integrability of continuous functions follows almost immediately 
from a basic fact: continuous functions on closed bounded intervals [a,b] are uniformly contin- 


uous. That is: 


Theorem 2.A.3. If f is continuous on the closed interval [a,b], then, for alle > 0, there exists 


6 >0 such that, if x, and x2 are in [a,b] and |x; — x| < 6, then f(a1) — f(x2) <e. 


Proof. See Theorem 2.2.12 of [4]. 


The “uniform” in “uniformly continuous” refers to the fact that, for a given « > 0, there isa 
uniform 6 > 0 that works everywhere in the interval; normal continuity allows 6 to vary as you 


change one of the points. 


It is now easy to prove: 


Theorem 2.A.4. Suppose that f is continuous on the interval [a,b], where a < b. Then, f is 


Riemann integrable on [a, b]. 


Proof. Let ¢ > 0. Then, e/(b — a) > 0, and so, by Theorem [2.A..3} there exists 6 > 0 such that, 
if x; and 22 are in [a,b] and |x; — x2| < 4, then | f(x1) — f(x2)| < €/(b— a). 

Let P = (2o,...,%n) be a partition of [a, b] with mesh < 6. We claim that Us(P)—Ly(P) <, 
which would prove that f is Riemann integrable by Theorem [2.A.2] 

As ||P || < 6, for any two points a; and b; in the subinterval [x;-1, 2], |a; — b;| < 6. By 
considering the points in the subinterval where f attains its maximum and minimum values, we 
conclude that M,({w:-1, :]) — my ([ei-1, 2i]) < €/(b— a) and, hence, that 


n 


(Mp (lara, ea]) = my (len, ai])) Ae < Ser te = = Dae = €, 


i=l 
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ie., that 
U;(P) — Ly (P) <e. 


We can now prove Theorem 


Theorem 2.A.5. Bounded, piecewise-continuous functions on closed intervals are Riemann 


integrable. 


Proof. By dividing the interval [a,b] into subintervals such that each subinterval has a discon- 
tinuity at, at most, one endpoint. By Theorem [2.3.16] it is enough for us to prove that f is 
Riemann integrable on such subintervals. Thus, we will assume that f is bounded on [a, b], with 
| f(x)| < B, for x in [a, 6], and has a single discontinuity at a. Note that, if B = 0, f is identically 
0 on [a, b], and we’re finished; so assume that B > 0. 

Now, let « > 0. Consider first the partition of [a,b] given by P = (a,a+e/(4B),b). As f 
is continuous on [a + €/(4B),b], f is integrable on this subinterval, and so, by Theorem [2.A.2| 
there exists a partition P = (x9,..., am) of [a + €/(4B), b] such that U;(P) - L;(P) < €/2. Let 
P* be the partition of [a,b] given by (a, 20,...,2@n). 


Then, from the definitions, it follows at once that 


Us(P*) = My((a,ao]) -€/(4B) + U;(P) 


and 


o~ 


L;(P*) = my((a,a0])-€/(4B) + L5(P). 


Also, note that M,({a,xo]) < B, while my([a,xvo]) > —B, so that —mf([a,zo]) < B, and 
M (la, xo]) — my([a, o]) < 2B. 


Therefore, 


o~ n~ 


Us(P*) — Ly(P*) = Mg([a,xo])- «/(4B) + Us(P) — mg(la,ao])-€/(4B) — Ly(P) = 


(Mj ([a,20]) — mg((a,20]))- 5 + [UP) - LAP] < 5 +5 = 6 


and we are finished by Theorem 


Theorem 2.A.6. Suppose that a < b. 
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1. If f is Riemann integrable on [a,b] anda <c<d< b, then f is Riemann integrable 
on [c,d], t.e., if f is Riemann integrable on a given closed interval, then f is Riemann 


integrable on any closed subinterval of the given interval. 


2. Suppose thata <c<b. Then, f is Riemann integrable on [a,b] if and only if f is Riemann 


integrable on [a,c] and [c, b] and, when these equivalent conditions hold, 


[soe = [ sear + [sea 


Proof. Suppose that f is Riemann integrable on [a,b] and a<c<d<_b. By Theorem f 
must be bounded on [a,b] and, hence, is bounded on [c, d]. 

Let € > 0. By Theorem|]2.A.2| there exists a partition P of [a, b] such that Us(P)—Ly(P) <e. 
Now, let P* be the partition of [c,d] formed from the points in [c,d] NP, together with c and d. 
Then, 


Us(P*)—Ls(P*) < Us(P)—Lys(P) <6, 
and so, by Theorem|2.A.2| f is integrable on [c, d]. 
Suppose now that a<c< b. 


If f is Riemann integrable on [a,b], then part (1) tells us that f is Riemann integrable on 
[a,c] and [c, }]. 


Suppose that f is Riemann integrable on [a,c] and [c,b]. Then, f must be bounded on [a,c] 
and [c,b] and, hence, on [a,b]. Let « > 0. Then, «/2 > 0. By Theorem [2.A.2} there exist 
partitions P! and P? of [a,c] and [c,b], respectively, such that U;(P1) — Ly(P1) < €/2 and 
Us (Po) — Ly (Po) < €/2. Then, the partition P* = P! U P? of [a,b] is such that 


U;(P") — Ls(P") < 


Therefore, by Theorem f is Riemann integrable on [a, )]. 


Now, suppose that f is Riemann integrable on [a,c] and [c,b] and, hence, by the above, 
Riemann integrable on [a,b]. Let Ly = f° f(x) dz, Lz = L fie)de,and = i F(x) dx. For 
each integer n > 1, let P} and P? be partitions of [a,c] and [c, b], respectively, with mesh < 1/n. 
Let S} and S? be sample sets for the partitions P} and P?, respectively. Let P, = P}U P2 
and S,, = S} US?. Then, P,, is a partition of [a,b] of mesh < 1/n, and S,, is a sample set for 


Pn (possibly using the sample point c as a sample point for two different subintervals). 
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Using Theorem three times, we find 


1 2 
iy ie S lim Rei (f) + lim Res (f) = LR) = 2, 


Theorem 2.A.7. (Linearity of Integration) Definite integration over a closed interval is a 
linear operation, i.e., if f and g are Riemann integrable on [a,b], then, for all constants r and 


8, the function rf + sg is Riemann integrable on [a,b], and 


b 


[ (rf (x) + sg(x)) dx = rf seas te / g(a) de. 


Proof. Suppose that f and g are Riemann integrable on [a,b]. We will prove that f + g is 
Riemann integrable on [a,b] and that 


[ (f(x) + g(2)) dz = [soe 4 [ sear. 


That constants can be pulled out of integrals is left to you in Exercise [52| 


Let [c,d] be any closed subinterval of [a, b]. It is trivial to see that 


My+a(levd]) < My((c, dl) + My(le, dl) 


and 


my([e,d]) + mg(le,d]) < my+g(le d)). 


Therefore, for any partition P of [a,b], Up+g(P) < Us(P) + Ug(P) and Ly(P) + L,(P) < 
Ly+g(P); hence, 


Ustg(P) — Lpto(P) < [Us(P) — Ly(P)] + [Ug(P) — Lag(P)]- 


It follows immediately from Theorem that f + g is integrable on [a, J. 


Now, let P,, be a partition of [a,b] of mesh < 1/n, and S, be a sample set for P,. Using 
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Theorem three times, we find 


b 
[ (F@ +(e) de = tim REF +g) = Tim RECA) + lim RF (9) = 


n—- oo 


Using Theorem]2.3.19] (see Theorem 3.3.1 of [4]), it is easy to conclude Theorem |2.3.14 


Theorem 2.A.8. Suppose that f and g are defined on a closed interval [a,b], and that, except 


possibly for a finite set points in [a,b], f and g are equal at each point in [a, b]. 


Then, f is Riemann integrable on [a,b] if and only if g is, and when f and g are Riemann 


[soe = [ de. 


Proof. As f and g differ on at most a finite number of points, f is bounded if and only if g is 
bounded. If both functions are unbounded, then Theorem tells us that both functions are 


not Riemann integrable. 


integrable, 


Now, suppose that f and g are both bounded, and that f is Riemann integrable. Then, g—f 
is bounded, and is equal to 0 except at, possibly, a finite number of points, 71,..., 7p. We claim 
that La — f)(x) dx = 0; if we show this, we are finished by Theorem |2.3.19} since we would 


then have : 
[ toe = i, f(a) de + [ @-Ae@ae - - g(a) de. 


a 


Let M be the maximum of |(g — f)(r;)| for 1 <i < p. Let P = (20,...,a,) be a partition 
of the interval [a,b], and let S = (s1,...,5,) be a set of sample points for P. 


Then, 


n 


Sg - f(s Axi 


t=1 


< SUI - f\(si)|- Avi] < M-||P IL, 
i=l 


and, hence, 


jam, Reals -f)=0. 


Theorem 2.A.9. 
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Theorem 2.A.10. (Fundamental Theorem of Calculus, Part 1) Suppose that f is Rie- 
mann integrable on [a,b] and is continuous at a point xg in (a,b). Then, the integral function 
fae of f on [a,b] ts differentiable at xo and 


(1*!)'(z0) = f (xo). 


Thus, if f is continuous on [a,b], then rie is an anti-derivative of f on [a,b]. 


Proof. From the definition of the derivative, we have 


(1191) (w9) = tim = HP) Ja fat = JO Fat _ 
f —_ m = 


L209 xL— Xo L—2Xo a — Xo 


lim Jag FO) a MO a 


Z-ro L— XH 
We must show that this limit equals f(a). We shall show that the limit from the right equals 
f (xo), and leave the nearly identical argument from the left as an exercise. 


Let « > 0. By the definition of f being continuous at xo, there exists 6 > 0 such that, for all 
x such that |x — xo| < 6, | f(a) — f(xo)| < €. We claim that, if 0 < x — 2 < 0, then 


[i f@at 

foo 2.4 
2 = flao)) < (2.4) 
which is what it means that lim, 2+ —— = f (ao). 


To prove the claim in Formula [2.4] fix an x such that 0 < «— 29 < 6. Then, for all ¢ in 
[x9, 2], we have 0 <t— ao < x— 2 < 6, and so, |t — xo| < 6. Using the defining property of 6 
again, we conclude that, for all ¢ in [xo, 2], | f(t) — f(xo)| < €. 


Therefore, for all ¢ in [xo, a], 


f(wo)—€ < F(t) < f(to) +e. 
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By the monotonicity of integration, Theorem|2.3.20| and Proposition |2.3.12} we conclude that 


(f (wo) — «) dt < / " p(t)at < / (f(a) + €) dt = (fo) + €)(a — 20). 


Hence, 
* f(t) dt 
f (0) e< fon < f (xo) +6, 
Leé., 
Jorg F(t) at _ (x0) <€, 
tL — XO 


as we claimed in Formula [2.4] 


The final conclusion of the theorem follows from the first one, together with the continuity 


of I pel, which follows from Theorem [2.4.3] 


Theorem 2.A.11. The integral defined in Definition|2.5.11|1s independent of the splitting into 
subintervals (of the type described in the definition). 


Proof. We will prove a lemma, from which the theorem follows immediately. 


Suppose that f is continuous on the interval I = [a,b). Let c be such that a < c < b. We 
claim that Vs f(a) dz exists if and only if r f(x) dx exists and, in this case, 


[soe = [ seer + [sow 


This is easy: 


[ soa . jin [rey = jim [see r [sea] 7 


[ tea + lim [rove = [ toa " [sew 
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Theorem 2.A.12. Let I be an interval of the form |a,b) or (a,b], where we allow the intervals 
[a,0o) or (—co,b]. Suppose that, for alla in I, f(a) > 0, and that, for all closed intervals [c, d] 


contained in I, f is Riemann integrable on [c,d]. 


Then, if there exists a real number M (an upper bound) such that, for all [c,d] contained 
in I, fe f(a) dx < M, then , f(x) dx converges, and what it converges to is less than, or equal 
to, M; in other words, if there is an upper bound M on all of the ‘is f(x) dx, then rh f(a) dx 


converges to the least such upper bound. 


In particular, of ii f(x) dx diverges, what it diverges to is oo. 


Proof. We will deal with the case where I = [a,b); the other cases are entirely analogous. 


For a < d < b, define the function F(d) = fe f(x) dx, which exists by assumption. As f > 0, 


F is an increasing function on the interval I. 


If F gets unboundedly large, then, since F is increasing, limg_,»- F(d) = oo. If F is bounded 
above by M, then, as F' is increasing, F’ converges to the least upper bound of the set F(J), 
which is < M. 


Lemma 2.A.13. Let q(x) = co + 1" + cox? + c32°, where c; denotes a constant. Then, 


[ q(x) dx = * (6ea + 2cah?) = * (a(—n) + 4q(0) + q(h)). 


—h 


Proof. This is a trivial calculation, which we leave to the reader. 


Proposition 2.A.14. Let q(x) = ax? +ba+ce. Fizrxo. Leth > 0. Let x, = 2xo9+h, x2 = 21 +h, 
and y; = q(ai), fori =1,2,3. Then, 


i. q(x) dz = * (so +491 +92); 


Proof. One can make the substitution u = x — (ao + h) into the integral and reduce oneself to 
the case in Lemma or do a messy, but simple, algebra problem. One has to show that 


2h)3 2h)? 3 ba? 
rot ) | Mao + ) coe 2h) ss , a Sul = 
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*{ [ax + bro + ¢ + Ala(zo +h)? + (xo +h) +e] + [a(x + 2h)? + (x0 4 2h) + |}. 


We leave either verification as an exercise. 


In the proofs below, for notational convenience, we apply the inequalities above in the case 


where a = 0; the general case follows at once by making the substitution u = x — a. 


Theorem 2.A.15. Suppose that a < b and the midpoint Riemann sum with n subintervals of 
equal length Ax is used to approximate { f(x) dz. 


If f(a) exists for all x in some open interval containing the closed interval |a, b] and, if 
there exists a number M > 0 such that, for all x in [a, 6], |f’(x)| <M, then the absolute value 


of the error, Emapt, satisfies the inequality 


M(Azx)3n M(b— a) 
al = = 
[Emapt] S97 24n? 


Proof. We shall prove this for one subinterval, centered at 0. The general result follows by 
re-centering the intervals, via substitution, and then adding, which leads to the multiplication 
by n in the inequality. Let h = Aa/2. 


We will show that 


Mh? 
< ; 
~ 3 


h 
| J feovae ~ 2nF00) 


We must look ahead to the Taylor-Lagrange Theorem, Theorem |4.3.3}, which immediately 
implies that: if f is twice-differentiable on an open interval J around a point a, and M >Oisa 
constant such that, for all x in I, |f(ax)| < M, then, for all x in J, 


|f(x) — (f(a) +f’ @(a@-a))| < > (@- 4)’; (2.5) 


Now the proof is easy. 
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Theorem 2.A.16. Suppose that a < b and the Trapezoidal Rule with n subintervals of equal 
length Ax is used to approximate iC f(x) dz. 


If f(a) exists for all x in some open interval containing the closed interval [a,b] and, if 
there exists a number M > 0 such that, for all x in [a, 6], |f’(x)| <M, then the absolute value 


of the error, Etrap, satisfies the inequality 


|Etrap| < 


Proof. We shall prove this for one subinterval, centered at 0. The general result follows by 
re-centering the intervals, via substitution, and then adding, which leads to the multiplication 
by n in the inequality. Let h = Aa /2. 


We will show that 


[soe oy LOAN) 
—h 


We integrate by parts twice. 


1 / de 1 s i 
(f(b) + F(-h)) erro, - 3 eres 
Therefore, 
h h 
i fle)de — an LO *ICH) _ Pia pny — ag (-1- f 22f"(e) de} = 
—h —h 
1] f" 9 ee ee ae er 2\vdx - M4 _ 2Mne 
s| [cera] < 5 fae -2)Mae = Fe = EE 
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Theorem 2.A.17. Suppose that a < b, n is even, and Simpson’s Rule with n subintervals of 


equal length Ax is used to approximate p f(x) dz. 


If f(a) exists for all x in some open interval containing the closed interval [a,b] and, 
if there exists a number M > 0 such that, for all x in [a,b], |f“(x)| <M, then the absolute 


value of the error, Esimp, satisfies the inequality 


M(Az)’n _ M(b-a)?° 
180 ~ 180n4 


[site| 


Proof. We shall prove this for two subintervals, centered at 0. The general result follows by 
re-centering the intervals, via substitution, and then adding, which leads to the multiplication 
by n/2 in the inequality. Let h = Az. 


Let 
We will show that there exists a c such that —h <c< h and 


_BFO(O) 


wh) = 


Let 


One easily calculates that ¢(0) = ¢’(0) = ¢(0) = 0 and, for t £0, 


g(t) = 2t (eee 90 


3 2t eG w(t) 


Now, ¢(h) = 0, and so, by Rolle’s Theorem, there exists cy such that 0 < cy < h and 
¢'(c1) = 0. Applying Rolle’s Theorem again, we find that there exists cp such that 0 < c < 
cy < hand $”(c2) = 0. Applying Rolle’s Theorem yet again, we find that there exists c3 such 
that 0 <c3 <cg <c, <h and 


o'"(e3) —_ 0. 
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Applying the Mean Value Theorem, we conclude that there exists c such that —c3 < c < c3 and 


f"(e3) — f'"(-e3) = 2e3f(e). 


Combining the previous 3 displayed formulas, we conclude that 


which is what we wanted to show. 


Chapter 3 


Applications of Integration 


In this chapter, we will apply our results on anti-differentiation, definite integrals, and the 
Fundamental Theorem of Calculus to a wide variety of problems involving displacement, distance 
traveled, area in the plane, volume, surface areas, mass, centers of mass, rotational inertia, work, 


and hydrostatic pressure. 


Throughout this chapter, we state many of our general results as Propositions, though, in 
many cases, these Propositions could be used as Definitions. For instance, we will see the definite 
integral of speed, with respect to time, yields the total distance traveled. We assume that you 
have a preconceived notion of the distance traveled, and then conclude that that distance can 
be calculated by integrating speed. However, we could, instead, define the distance traveled by 
the integral of the speed. Mathematically, this latter approach is more rigorous; it is, however, 


somewhat intuitively unsatisfying. 


Thus, throughout this chapter, we shall usually assume that we have predefined physical 
terms, such as distance traveled, volume, mass, etc., and state our integration formulas for these 


quantities as Propositions. 


We should remark that many of the “applications” in this chapter are actually “pre-applications” , 
or what some people refer to as “toy problems”. A pre-application or toy problem is a problem 
that is stripped of many physical complications or is not inherently of interest in and of itself, 
but rather is designed mainly to provide some basic example of a fundamental idea that will 


serve as a building block to tackling more-difficult actual applications. 
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= 
3.1 Displacement and Distance Traveled 


in a Straight Line 


In this section, we will apply the definite integral and the Fundamental Theorem of Calculus 
to problems involving the net change in position, the displacement, of an object, and the total 
distance traveled by the object. If the object changes its direction, these two quantities will not 
be the same; the total distance traveled would be greater, while the displacement could, in fact, 


turn out to be zero if the object ends up back where it started. 


The second part of the Fundamental Theorem of Calculus, Theorem |2.4.10| tells us that, if 


F is an anti-derivative of a continuous function f on the interval [{a, b], then 


b 


b 
/ f(t)dt = F()| = F(b)— F(a). 


a 


This means that, if we start with a function g(t), which is differentiable on an open interval 


containing [a,b], and g’(t) is continuous, then 


b b 
[soa = [ Bat = 9-900. 


Suppose now that we have an object moving in a straight line, on which we’ve chosen positive 
and negative directions, i.e., suppose that we have an object moving along a coordinate. Let 
p(t) denote the position (i.e., the coordinate value) of the object at time t. 


Assuming that p is continuously differentiable, we know that the velocity v = v(t) of the 
object is the rate of change of the position, with respect to time, dp/dt, and the Fundamental 
Theorem tells us that 


The quantity on the right above is the change in the position of the object between times 
t=a and t = 0; recall that this is called the displacement of the object between times a and b. 


Of course, when we refer to the velocity function v = v(t), we are implicitly assuming that 


the position function is differentiable, for, otherwise, “the velocity” has no good meaning. Thus, 
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we can summarize our discussion above by: 


Proposition 3.1.1. If the velocity v = v(t), as a function of time t, of an object on a 
coordinate axis is continuous on the interval [a,b], then the displacement of the object 
between times t= a andt = b is given by 


| u(t) dt. 


Example 3.1.2. A particle is moving in a straight line in such a way that its velocity v = v(t), 


in m/s, at time t seconds, is given by 


v = 3t?7—12t+8. 


What is the displacement of the particle between times 1 and 3 seconds? Between times 2 


and 4 seconds? What is the displacement between times 1 and t seconds, for arbitrary t? 


Solution: 


To answer the first two questions, we need to calculate 


3 4 
/ (3t? — 12¢ + 8) dt and | (3t? — 12¢ + 8) dt. 
1 2 


We want to apply the Fundamental Theorem, and so we need to find the (or, an) anti-derivative 


Bp #2 
foe -i2sea a a C = t-6+8t+C = t(t—2)(t-4)+C. 


Therefore, the displacement between times 1 and 3 seconds is 


3 3 
/ (34? —12t+8)di = t(t-2)¢-—4)| = -3-—3 = —6 meters; 
1 
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this means that the particle ended up, at time 3 seconds, 6 meters in the negative direction from 
where it started at time 1 second. 


Note that we do not know where the particle is at times 1 and 3 seconds, i.e., we don’t know 
the position function p(t). However, except for the arbitrary constant +C above, we know 


p(t), and that’s enough to determine the change in the position. 


The displacement of the particle between times 2 and 4 seconds is 


4 
| (3t? — 12+ 8)dt = #(t-—2)(t-4)| = 0-0 = O meters. 
2 


This does not mean that the particle did not move between times 2 and 4 seconds; it merely 
means that the particle, at time 4 seconds, ended up at the same place where the particle was 
at time 2 seconds. Of course, this means that the particle had to turn around at some point. 
Do not confuse displacement, the net change in position, with the total distance traveled. 
The displacement here, between times 2 and 4 seconds is 0. We shall calculate the total distance 


traveled by this particle, between times 2 and 4 seconds, in the next example. 


Finally, in this example, we were asked to find the displacement of the particle between times 
1 and t. This is easy, except that it would now be bad form to use t for the integration variable. 
You should use essentially any other variable for the dummy variable of integration; the only 
variables you should avoid are t and v, and maybe d would look confusing too. We'll use z for 
the integration variable. Then, we quickly find that the displacement between times 1 and t 


seconds is 


[et 12248)de = 2(2-2)(2-4) = t(t—2)(t—4) —(1)(—1)(—3) = #° —6t? +. 8¢—3 meters. 


In the example above, we ran into the issue of the distinction between displacement and total 
distance traveled. This distinction arises because, if the velocity v of an object is continuous and 
always positive on an interval of time [a,b], then the distance traveled by the object between 
times a and b is the same as the displacement, namely f?vdt. On the other hand, if the 
velocity v of an object is continuous and always negative on an interval of time [a, b], then the 
displacement will be negative; it will be the negation of the distance traveled by the object 


between times a and b, i.e., the distance traveled will be — [? udt = f” —vdt. 


Thus, to find the distance traveled, we want to integrate v on intervals where v is positive, 
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and we want to integrate —v on intervals where v is negative. The way to write this in one 


formula is to say that we want to integrate the absolute value |v| in all cases. 


Proposition 3.1.3. If the velocity v = v(t), as a function of time t, of an object on a 
coordinate axis is continuous on the interval [a,b], then the (total) distance traveled by 
the object between times t= a and t = b is given by 


[ lu(t)] de. 


Remark 3.1.4. Proposition tells us, in a new way, something that we already knew. It 
tells us that the total distance traveled, r = r(t), by an object, between some initial time ty and 


some arbitrary time t, is given by 
t 
rt) = f lo@)|ae, 
to 


where we once again have used z as a new dummy variable, and we’re assuming that v is 


continuous. 


Now, the first part of the Fundamental Theorem, Theorem|2.4.7| tells us that 


dr 
dt 


i.e., that the (instantaneous) speed of an object can either be defined as the absolute value of the 


velocity or as the instantaneous rate of change of the distance traveled, with respect to time. 
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However, it is important to remember that average speed need not be the absolute value 
of the average velocity. For instance, an object that returns to where it started, after some 
amount of time, will have zero displacement and, hence, zero average velocity, but not zero 
average speed. Average speed is the average rate of change of the distance traveled, with 


respect to time. Thus, it is probably best, in the average and/or instantaneous setting, 
to define speed as the rate of change of the distance traveled, with respect to time, and 
then take it as a theorem that the instantaneous speed is equal to the absolute value of the 
instantaneous velocity. 


Let’s look at an example of calculating the (total) distance traveled. 


Example 3.1.5. In Example |3.1.2| we had a particle moving in a straight line in such a way 
that its velocity v = v(t), in m/s, at time t seconds, was given by 


v = 3t7—12t+8. 


We found that the displacement of the particle between times 2 and 4 seconds is 


4 4 
| (3t? — 12¢+8)dt = t(t— 2)(¢— 4)], = 0-0 = O meters. 
2 


Now we'll ask a different question. 


What was the distance traveled by the particle between 2 and 4 seconds? 


Solution: 


It’s certainly easy to write the appropriate integral. The distance traveled by the particle 
between 2 and 4 seconds was 
4 
| ae — 12¢+ 8| dt meters, 
2 
but how do we calculate the integral of the absolute value? 


One thing that you definitely don’t do is find an anti-derivative of 3t? — 12t + 8, and then 


take its absolute value, in hope of producing an anti-derivative of |3t? —12t+ 8|. This would 
be completely wrong. 
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There’s really only one thing to do; split up the integral into integrals over subintervals on 
which 3t? — 12¢+ 8 is always > 0 or is always < 0. Then use that, if 3t? — 12t+ 8 > 0, then 
|3¢? — 12¢ + 8] = 3? — 12¢ + 8 and, if 3¢? — 12¢+ 8 < 0, then |3¢? — 12¢ + 8] = —(3¢? — 12¢ + 8). 


As v(t) = 3t? — 12t + 8 is a continuous function, it can switch signs only at points where 
it hits 0. We find these points, then check the sign of v(t) in-between the zeroes. Setting 
3t? — 12#+ 8 = 0, and using the quadratic formula, we find that the zeroes of u(t) occur where 


, — Btvi44—-96 _ , 2/3 
7 6 ss 


Obviously, 2 — 23/3 < 2. However, 2 + 2\/3/3 is in the interval [2,4]. Thus, the sign of v(t) 
does not switch from positive to negative, or vice-versa, on the interval [2,2 + 2\/3/3] or on 
the interval [2 + 2\/3/3, 4]. As v(2) = —4 and v(4) = 8, we find that v(t) < 0 on the interval 
(2,2 + 2\/3/3], and that v(t) > 0 on the interval [2 + 2V3/3, 4]. 


Therefore, the distance traveled, in meters, by the particle between times 2 and 4 seconds 


was 


4 2+2V/3/3 4 
| |3e? — 126+ 8] dt = | |3t? — 12¢ + 8] dt + | |3e? — 12+ 8] dé = 
2 2 2+2V/3/3 


2+2V3/3 4 
| —(3¢? — 12¢ + 8) dt + | (3t? — 12¢ + 8) dt. 
2 2423/3 


Using plus or minus our anti-derivative from Example |[3.1.2| we find that the sum above 


equals 
oe 


4 
ms ot 2)(t- 4) = 


SHA) 242/3/3 


— (2+ 2V3/3) (2V3/3) (— 2 + 2V3/3) — 0| + [o — (2+ 2V3/3) (2V3/3)(—2+ 2V/3/3)| = 


16/3 ‘ 16/3 323 


x 6.1584. 
9 9 9 


It is somewhat interesting to note that, since the displacement between times 2 and 4 seconds 


was 0, the average velocity between times 2 and 4 seconds was 0, but the average speed was 
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approximately 6.1584/(4 — 2) = 3.0792 meters per second. 


As you can see, the calculation of the distance traveled and/or average speed can be, and 
usually is, significantly more difficult than the calculation of the displacement and/or average 


velocity. 


We should make a final remark in this section. 


Remark 3.1.6. It may seem as though we have competing notions of what average velocity 


and average speed mean. 


Suppose that, at time t, we let p(t) denote the position of an object which is moving along 
line, let r(t) denote the total distance traveled by the object, and let v(t) denote the velocity, 


which we will assume is a continuous function of t. 


Then, the definition of the average velocity and average speed of the object, between times 


t =a and t = b (or, on the interval [a, b]), where a # b, are 


r(b) = r(a), 


average velocity = ———-— and average speed = b 
—a 


However, in Definition|2.3.30| the average value of any Riemann integrable function f(t) on 
the interval [a,b] was defined to be 


1 b 
— | fa. 


and we could consider the average value of the (instantaneous) velocity and speed functions, 


v(t) and |v(t)|, respectively. 


Do we need to worry that “average velocity” and “average speed” might refer to the average 
values of the velocity and speed functions, instead of referring to our original definitions? No. 
As we discussed in this section, {” v(t) dt = p(b) — p(a) and {° |v(t)| dt = r(b) — r(a). Thus, 
the average value of the velocity function v(t) equals the average velocity from our original 
definition, and the average value of the speed function |v(t)| equals the average speed from our 


original definition. 
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3.1.1 Exercises 


Throughout the exercises, assume that units of length are meters and that units of 
time are given in seconds and, hence, that velocities are given in m/s and acceler- 


ations in m/s’. 


In Exercises 1 - 10, the velocity function of particle is given. Find the total dis- 


placement of the particle between the times to and f;. 


1. v(t) = 8 — 3t7 +1, to =0, t; = 10. 
2. v(t) = 1/(t — 3) + cos 4t, to = 27, ty = 4a. 
3. v(t) = cosh 3t, to = —5, t) = 5. 
A, v(t) = sinh 3t, to = —5, ty = 5. 
5. u(t) : inp= 5, =8 

. U — ry = 5 — 7 

a2 —64’ © : 

6. v(t) = t7/(# — 1), to = 3, ty = 6. 


7. 00) Stan’ t, 16 = 0, 7 = 7/4. 


Se ee ee eee 
Uv a i =U, arn i 
Vise . 
ga) a ee 
. v(t) = ——_—__~ = a 
25 +362? ° 4 
10. v(t) = (cost)e"™*, to = 0, t: = 7. oe 


Calculate the total distance traveled by the particle over the given time interval 


with the given velocity function. 
11. v(t) = t? — 5t + 6, [0,5]. 
12. v(t) = 2t? + 12¢ +1, [—4, 4]. ec 
13. v(t) = sin 2t, [0,2/2). 
14. v(t) = et — 1, [0,2]. 


15. v(t) = e—*! + ¢? + cos?t + cosht + 1, [—4, 4]. 
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#? — 9t + 18 
18 Ut) = age? ye: 
6t 


18. v(t) = V4t? + 4¢ + 10, [-3, 5]. 
19. v(t) = In(t/2), [1,4]. 
20. v(t) = sint + cost, [0, 27]. 


In Exercises 21 - 25, calculate (a) the average velocity and (b) the average speed of 


the particle traveling with velocity function v(t) over the given time interval. 


21. v(t) = sin 3t, [—7/6, 7/3]. 


22. v(t) = ¢t? + 5t — 14, [0, 12]. 
_ 6t+8 

 P+t-6' 
24. v(t) = sec? 4t, [—71/10, 1/12]. 


23. v(t) (411, 


25. v(t) = tcosh?t?, [—2, 4]. 


In Exercises 26 - 30, a particle is moving with velocity v(t). Calculate the position 
function in terms of the time ¢ from the given initial time tj) where the particle’s 


position is p(t)). Assume that ¢ > to. 


26. v(t) =t? + 5t — 24, p(0) = 12. eo 
27. v(t) = tint, p(1) =0. 

28. v(t) = cot? t, p(m/4) = 3. Assume t < 7. 
29. v(t) = >=—— 


30. u(t) = secttant, p(0) = —3. Assume t < 7/2. 


In Exercises 31 - 34, a particle is moving with velocity v(t). Calculate the total 
distance D(t) traveled by the particle between the specified starting time to and an 


arbitrary time t, where t > to. 


31. v(t) = e* + cos?t + ¢4, tp = 0. 


32. u(t) =#? + 7¢— 18, ty = 3. 
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33. 


34. 


35. 


36. 


37. 


38. 


39. 


AO. 


v(t) = # + 7t — 18, tp = 1. 


v(t) = sinht, D(0) = 0. 


Suppose v(t) = (¢+2)(t—3) and we’re interested in calculating the total distance traveled 
by the particle between t = 0 and t = t, where t; > 0. Then the integral D(t,) = 


ti 
i |(t + 2)(t — 3)| dt will need to be evaluated in two pieces because of the sign change at 
0 
t = 3. Is the function D(t,) continuous at t; = 3? Is it differentiable? eo 


Redo the previous problem if v(t) = (¢ — 3)?. Why is this function easier to work with 
when calculating the distance function than the one in the previous problem? 


Suppose a paratrooper jumps from an airplane with initial velocity v(0) = 0 and accelerates 
downward at g = 9.8 m/s” for three seconds. She opens her parachute and then accelerates 
upward at a rate of 2 m/s? for 5 more seconds. 

a. Write an expression for u(t), where “up” is used as the positive direction. 

b. What is the paratrooper’s displacement over the eight seconds? 

c. What is the paratrooper’s average velocity over the eight seconds? 
Suppose the A train is traveling at 60 mph. The conductor sees the B train in front of 
him on the same track moving in the same direction with speed 30 mph. The conductor 
puts on the breaks causing constant deceleration of a mph when the trains are exactly 1/4 
of a mile apart. 

a. How far will the A train travel, in terms of a, before it comes to rest? 

b. How far will the B train travel, in terms of a, before the A train comes to rest? 


c. What is the minimum a must be to prevent a collision? 


Suppose v(t) is continuous. Prove, or provide a counterexample to, the statement 


[ wolee= [mwa oe) 


Consider the position function 


? sin i : 
(geen si 
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a. Prove that p is differentiable for all t. 

b. Prove that p’(t) is not continuous at t = 0. 

c. What are the ramifications of (a) and (b) on the application of the fundamental 
b 


theorem of calculus? Specifically, is it true that | p(t) dt = p(b) — p(0)? 
0 
In each of Exercises 41 through 43, you are given the acceleration function a(t) for a 
particle and the velocity of the particle at time t = 0. Calculate the average velocity 


of the particle between times ¢ = 0 and t= 1. 


41. a(t) = 2sin 3t — 3cos 2t. v(0) = 4, ti = 7/6. Cc 
P 
42. a(t) = a +4. v(0) = 6, t; =9. 
43. a(t) = =< v(0) = 4,  =3. 
25 — ¢? 


44. Calculate the average acceleration of the functions in the three previous problems. 


In some applications, it is reasonable to consider the limit of the displacement or 
speed of a moving particle as t — co. These limits may or may not exist and should 


be evaluated using improper integrals. 


45. Suppose a force is acting on a particle in such a way that the velocity of the particle is 
u(t) = e~*. What is the limit as t; — oo of the distance the particle has traveled between 


times t = 0 andt=f,? 
46. Using the velocity function in the previous problem, 


a. What is the average velocity between times t = 0 and t = t1? 


b. What is the limit of the average velocity between times t = 0 and t = t, as tj — co? 


47. Consider a particle traveling in such a way that its velocity function is modeled by a 


damped oscillation. Specifically, assume u(t) = e~* sint. 


a. Show that |v(t)| + 0 as t > co. 
kr 
1 _, 
b. Show that 7 |v(t)| dt = se a +e") for k= 1,2,3,.... 
(k-1)7 
c. What is |u(t)| dt, where n is a positive integer? In other words, what’s the total 


0 
distance traveled by the particle during this time interval? 


nt 
d. What is the limit, if it exists, ot | \u(t)| dt as n — co? 
0 
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As we shall discuss at length in Section the formula for the distance traveled 
by an object traveling along a curved path in two or three-dimensional Euclidean 
space is a natural generalization of our current formula for distance traveled along 
the z-axis. Suppose that p is a differentiable curve which gives the position of an 
object in three-space. Let p(t) = (a(t), y(t), z(t)), where t € [a,b]. Then the velocity 


vector and instantaneous speed of the particle at t = tg are, respectively: 


B(to) = B’(t) = (2'(to),y/(to), (to) and _—|@(to)| = V[x’(to)]? + ly’ (to)? + [2’ (to) ?. 


This notation is convenient because the formula for the total distance traveled 


between times to and t is exactly the same as the one-dimensional formula: 
t 
r(t) = |u(z)| dz. 
to 


48. Suppose an object is traveling in a circular orbit with radius R. The position function is 
p(t) = (Reost, Rsint). What is the total distance traveled by the particle between times 


t,; and t2? Show that this is equal to the arc length of a sector of a circle with central 


angle tg — ty. 


49. The previous problem confirms our intuition that calculating the distance traveled by a 
dynamic particle is equivalent to measuring the length of a path. A DNA molecule is 
shaped like a helix- a spiral around a cylinder. The approximate radius of the molecule is 
10 angstroms, and the helix rises by about 34 angstroms per revolution. 


a. Show that this path can be parameterized as p(t) = (10cost, 10sint, 17t/7). 


b. Calculate p’(t). If we think of pas the position of a particle traversing the helix, then 


+ 


p(t) is the velocity vector. 


c. Calculate |p’(t)|. One can interpret this as either the instantaneous speed of a particle 
traversing the helix, or as the length of the tangent vector. 


d. What is the length, or distance traveled by a particle traversing the helix, in one 


revolution? 


e. A DNA molecule has approximately 285 million turns. What is the approximate 
length of a DNA molecule? 
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& 
3.2 Area in the Plane 


Given a continuous or, at least, integrable (Riemann integrable, or in the extended manner 


defined for improper integrals), we have looked at examples of calculating the area under the 
graph and above intervals on the x-axis or, for negative functions, areas above the graph and 


under intervals on the x-axis. 


In this section, we will look at the more general problem of calculating the area “trapped” 
between the graphs of two different functions. This is a relatively easy application of integration. 
However, as we shall see, the main new difficulty is reminiscent of the difficulty we saw in the 
last section, where we wanted to calculate distance traveled, instead of displacement: we need 


to integrate the absolute value of a function. 


Back in Proposition |2.3.11| we saw that if f > 0 and integrable on an interval [a,b], then 
fi f(x) dx is equal to the area under the graph and above the interval [a,b]. We also saw that 
if f < 0 and integrable on an interval [a,b], then af f(a) dx = PEF) dx is equal to the 


area above the graph and under the interval [a, 6]. One formula that unites these two results is 


given in: 


Proposition 3.2.1. Suppose that we have a function y = f(a) defined and integrable on an 
interval, I, from a to b. Then, the area between the graph of y = f(x) and the x-azis, 
for x in the interval I (or, on I), is 


i: “[#(@)| de 


Of course, the absolute value signs in Proposition[3.2.1] give a very succinct way of expressing 
the result, but they hide the difficulty: to actually evaluate es | f(a)| dx, we have to do what 
we did for distance traveled problems in the previous section, namely, split the integral up into 
pieces over various subintervals on which f(x) is > 0 and on which f(x) is < 0. We then use 
that, if f(a) > 0, then |f(a)| = f(x) and, if f(x) < 0, then |f(x)| = —f(a). 
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Example 3.2.2. Calculate the area between the x-axis and the graph of y = Ing, for $ 2 <2. 
See Figure |3.1 


-1 


Figure 3.1: Area between the z-axis and y = Ing, 4 <a@<2. 


Solution: 


The area of the region is given by 


2 
| | In a| da. 
1/2 


But how do we calculate this? 


We use that Inz < 0if0 <2 <1, Inz > 0 if x > 1 and, as we saw in Example {1.1.21 
integration by parts tells us that [ nadz = xlnx—x+C. (Don’t get confused: the derivative 


of Ina is 1/x, but we’re integrating here.) 


Therefore, 


2 1 2 1 2 
/ |Ina|dx = / |Ina|da + / |Ina|dxz = / —Inadx + / Inedx = 
1/2 1/2 1 1/2 1 
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| (0-1)4 (5m(5) 5) + [(2In2—2)-(0-1)] = ; 5in2 +2In2—1 = 


If we think of the x-axis as the graph of y = g(x) = 0, then finding the area between the 
graph of y = f(a) and the z-axis is the same as finding the area between the graphs of y = f(z) 
and y = g(x). Our question now is: can we use integration to find the area trapped between 
two graphs in the more general case in which we don’t assume that one graph is the x-axis? 


Let’s think about this. Consider the functions y = f(x) = x? and y = g(x) = 4. Suppose we 
want to find the area between the graphs of f and g forl <a <3. 


10, 
y=x° 
y 
5 
y= 
{ 4 x > 
-1 0 1 12 3 4 5 


Figure 3.2: The area between the graphs of y = x? and y = 4. 


What we do is split the problem into two pieces, pieces which correspond to which function 
is bigger, f or g. Hence, we find the area between the graphs for 1 < # < 2 and add to that the 
area between the graphs for 2 < 2 < 3. 


How do we find the area between the graphs for 1 < x < 2? What we could do is find 


the area under the graph of y = 4, namely i 4da (which, even without integrating, we know 
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is 4-1, since the region is a rectangle of height 4 and width 1), and then subtract the missing 
area, ie x? dx, under the graph of y = x7. We would find that the area between the graphs for 
l<a<2is 


Similarly, we could calculate the area between the graphs of f and g for 2 < x < 3 by taking 
the area, iC x? dx, under the graph of y = x”, and above the interval [2,3] and then subtracting 
the missing area iis 4dzx. We would find that the area between the graphs for 1 < x < 2 is 


However, there’s one psychological reason and one practical reason why, on the different 
subintervals, you don’t want to think of taking the entire area under one graph and subtracting 
the entire “missing” area under the other graph; you want to think of integrating the differences, 
i.e., you want to think of the problem in terms of SP (g(a) — f(x)) dx and F(a) — g(x)) dx in 
the first place. 


Psychologically, you shouldn’t think of involving any regions other than those that are 
trapped between the two graphs. Instead of subtracting areas of entire regions, you should 
think of taking the continuous sum of areas of infinitesimal rectangles that lie between the two 
graphs, rectangles of infinitesimal width dx, and height (g(x) — f(x)), if 1 < « < 2, or height 
(f(x) — g(x)), if 2< a < 3. See Figure [3.3] Thus, we have rectangles of infinitesimal area 
dA = (g(x) — f(a)) dx or dA = (f(x) — g(x)) dx, and we should add up these infinitesimal areas, 
by taking the integral, to find the total area. 


In a practical sense, for other functions and intervals, it could be significantly easier to 
calculate (fe) — g(x)) dx than to separately calculate iM f(x) dx and S? g(x) dx and then 


a 
subtract. How is this possible? Nasty terms oo 2 out in f(x) — g(x). For instance, if 


f(z) =a? +e-@ and g(x) =4+4+e-*’, then Kr — g(x)) dx = Lies (x? — 4) dx, which is easy 
to calculate, but you would not succeed in ane either [> f(x) dz or [5 g(x) dx. 
So, what we have seen is that, over subintervals [a,b] where g(x) > f(a), ie., where g(x) — 


f(x) > 0, we find the area between the graphs of y = f(a) and y = g(a) by calculating 
(g(a) — f(x)) dx. Over subintervals [a,b] where f(x) > g(x), i-e., where f(z y= at ‘ > 0, we 
find the area between the graphs of y = f(x) and y = g(a) by calculating ft (f( (x)) dx. 


Therefore, regardless of which function is greater, our integrand is always | f(a) — . : 
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104 | | 
y=x" 
y | 
: | height = f(x)-g(x) 
4 : 


height = g(x)-f(x) j 


Figure 3.3: Infinitesimally wide rectangles between the graphs of y = x? and y = 4. 


Proposition 3.2.3. Suppose that we have functions y = f(x) and y = g(a) defined on an 
interval, I, from a to b, and that f(a) — g(x) is integrable on I. Then, the area between 
the graphs of y = f(x) and y = g(x), for x in the interval I (or, on I), is 


b 
/ LF(e) — g(a) de. 


Note that, if g(x) = 0, so that the graph of y = g(z) is just the z-axis, then Proposition 


reduces to Proposition 


Example 3.2.4. Let’s finish our problem from the discussion above. We have the functions 


y = f(x) = x? and y = g(x) = 4, and we want to find the area between the graphs of f and g 


forl<a<3. 


Solution: 


We need to calculate 


3 
‘ |x? — 4| dx. 
1 
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In order to deal with the absolute value, we need to know where x? — 4 > 0 and where 
x? —4 <0. As x? — 4 is continuous, we first find where x? — 4 = 0. This is easy; it happens 


when x = +2. Deleting these two zeroes divides the real line into three subintervals (—co, —2), 


(—2, 2), and (2,00), and the sign, +, of x? —4 cannot change on a given one of these subintervals, 
for the Intermediate Value Theorem tells us that continuous functions must pass through zero 
to switch signs. 


Thus, to find the sign of 2? — 4 on each of these subintervals, we may simply pick any x 
value in the subinterval, evaluate x? — 4 there, and see whether it’s positive or negative. In fact, 
as we’re integrating from 1 to 3, we don’t actually care about what happens on the interval 
(—oo, —2). From the interval (—2, 2), we’ll pick x = 0, and find then that x? —4 = —4 < 0; thus, 
x? — 4 <0 for all x in the interval (—2,2), and so x? — 4 < 0 for all x in [—2, 2]. In particular, 
x* — 4 <0 for each z in the interval [1,2]. When x = 3, 27 -4=5 > 0, and so 22 —4> 0 for 
all x in the interval [2,00). Of course, it follows that 2? — 4 > 0 for all x in the interval [2, 3}. 


Hence, we find 


3 2 3 
| je? al ac = f -(a?-4)ae + f (a* — 4) dz = 
1 1 2 


Example 3.2.5. Of course, not all problems about area between graphs involve a change in 
signs of f(x) — g(a). Consider the problem of finding the area between the graphs of y = sin x 
and y = cosa, for0<a< 7/6. 


All that you need to know is that, for 0 < a < 7/6, cosx > sina, so that cosx — sinz > 0, 


and, hence, | cos x — sinz| = cosa — sinz. Thus, the area between the graphs is 


Me 7/6 1 VB 3-1 
Area = | (cosz —sinx)dx = (sinx + cos) .. + ae 0-1l= a : 
0 
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1.5 


y=cos(x) 


-0.5 


Figure 3.4: Area between y = sinz and y = cosz,0 <2 < 7/6. 


Really, in a problem where one function is always bigger than (or equal to) the other function, 
you shouldn’t think in terms of absolute values at all; just take the bigger function, subtract the 


smaller function, and integrate. 


There are times when you are not explicitly given the limits of integration. Let’s look at a 
typical such problem. 
Example 3.2.6. Find the area of the bounded region between the graphs of y = f(x) = x? 
2 


and y = g(x) = x. 
Solution: If you look at the graphs in Figure [3.5] you can see the region between the graphs 
of y = x? and y = x? naturally breaks up into three pieces: one piece where x < 0, one piece 
where 0 < x < 1, and one piece where x > 1. But the pieces where x < 0 and x > 1 are 
unbounded, i.e., go out infinitely far. The bounded region between the graphs is the portion 
where 0 < x < 1; the part that’s really trapped, or completely bordered, by the graphs. 


After we know this, the problem is easy, except for one small issue. You probably normally 
think that x? is bigger than x”. Right? Doesn’t cubing a number give you something bigger than 
squaring it? Not for numbers between 0 and 1! (And certainly not for negative numbers, 
either.) If 0 < x < 1, then x? > x3. Therefore, for 0 < x < 1, 2? — x3 > 0, and we don’t need 
to use absolute values to find the area. The area of the bounded region is simply 


1 3 441 1 
Area = | (2? — 2°) dz = ae - : 
0 
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-0.5 


Figure 3.5: Area of bounded region between y = x? and y = 2?. 


In the following example, we look at a problem in which nasty pieces of the functions involved 


cancel out. 


Example 3.2.7. Let f(x) = e~" + e® and g(x) = e~3*+4 + e*. Find the area between the 


graphs of f and g for0< a2 <3. 


Solution: 


You might be inclined to graph f and g by hand, or by using a calculator or computer. The 
problem is that the e® terms get so big so fast that you will have difficulty seeing the relatively 
small difference between the values of the functions. The good news is that we don’t need to 


see a picture in order to solve the problem. 


We need to calculate 


@ 3 
i | f(x) = g(x)| dx = i; le~* _ e 3et4 tn: 
e 0 


To deal with the absolute value, we need to determine where e~* — e~3*+4 


Lv e3rt4 


is positive and 


where it’s negative. So, we find where e~ = 0, ie., where e~® = e~3*+4, Taking 


natural logs of both sides of the equation (or using that the exponential function is one-to-one), 
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we find that —xz = —32 +4, and so x = 2. 


Therefore, since e~* — e~3*+4 is continuous and is zero only at x = 2, we must have that 
e-* — e 3#+4 is either always positive or always negative on each of the intervals (—oo, 2) and 
(2,00). We simply check the sign of the function at some point in each interval. When x = 0, 
ee Set+4 — 69 ¢4 < 0; hence, e~?—e—9*+4 < 0 for all x in (—00, 2), and so e~*—e—3#+4 <0 
for all x in (—oo, 2]. Similarly, when x = 3, e~* — e~3**+4 = e~3 — e~® > 0, and we conclude 


that e~* — e—3*+4 > 0 for all z in [2, 00). 


Thus, we find 


3 2 3 
/ |e~* _ er dx = | le~* - ecser dx + | le~* - al dx = 
0 0 2 


2 3 
i; —(e* _ er dx + | (e-* _ eer) dz. 
0 2 


az+b so that we can use this 


Let’s produce an anti-derivative for all functions of the form e 
result to integrate each piece in the integrals above. Consider f e%”*’ dz, where a and b are 
constants, and a #4 0. Make the substitution u = ax + b, so that du = adz, i.e., dx = du/a. 


Then, we find 


—3a+4 


Now, we apply this formula to e~® and e , and obtain 


2 —3044 |2 —2 4 
| —(e* _ ert) de = et € _ eo? € (1 € ) = 
0 3 0 3 3 
2e~? e4 
3 30 


We also find that 
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34 ee -2 e? 34 e° a 26-7 
e€ e€ = -e 
3 3 3 3 
Putting together all of our above work, we find: 
3 
Area = | le“* —e **t4l dx = 
0 
—2 4 —5 —2 4 —2 —5 
2e (i ge ee pee _ e& tde = +e -3_4 


In the next example, we consider the same two functions that we did above, but now we look 
at the area of an unbounded region, a region that extends out infinitely far. This, of course, 


leads to an improper integral. 


Example 3.2.8. Let f(x) = e~* + e® and g(x) = e~3*+4 + e*. Find the area between the 
graphs of f and g for 0 <x < ov. 


Solution: 


These are the same functions that we used in the previous example, and so we can use much 


of our work from that problem. 


We need to calculate 
Co 
| le~* - oer dz. 
0 


We could do this either one of two ways. 


First, we could proceed exactly as we did in the previous example, and write 


ee) 2 love) 
- le* — ger dx = | |e~* -_ er dx + | le~* - la dx = 
0 0 2 


2: co 
; = (e* a oo) dx + | (e* - cael dx. 


We already calculated, in the previous example, the integral on the left; we would still need to 


calculate the improper integral on the right. 
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However, given that we already calculated i, |e~* — e-**4| dx is the previous example, it 
would also be reasonable to split the integral up in the following manner: 


co 3 co 
; ae - oat dx = ih le~* - mead dx + | le* — e 3t+4| dz = 
0 0 3 


e+ +4e-2 +e 5 


where, again, we still need to calculate the improper integral on the right. 


We’ll use the second splitting of the integral, though it makes little difference. We find 


‘ 


poo 3 


co e 3e+4 
/ (e* - ese) dx = lim {6 + 
3 


Therefore, our final answer is 


Co 
Area = | je" =e er dx = 
0 


What if you want to find the area between curves that aren’t given to you by having y in 


terms of x, but instead are given to you by « being specified in terms of y? For instance, how 
do you find the area between the curves/lines x = 4— y? and x = 2—y for0< y <2? 
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-1 


Figure 3.6: Area between x =4—y? andzr=2-y,0<y<2. 


There are several ways to approach this problem, some good, some not so good. Let’s start 
with the not so good. 


e Probably the worst way to approach this problem is to think “I know how to do area problems 
only when I have y in terms of x, so Ill rewrite both equations, solving for y”. You get 
y = /4—< (there’s no plus or minus sign because we’re looking where y > 0) and y = 2-2. 
Understand that in other problems, this step, of solving algebraically for y in terms of x, might 
be impossible (in any reasonable way). That’s a big downside to this method in general, but we 


can try to proceed anyway with our specific problem. 


So, now you need to find the area between the two graphs of y = 4 — a and y = 2 — 2, for 
0<y< 2. If you look at Figure [3.6] it’s easy to see another difficulty: in terms of x, the region 
that we’re trying to find the area of is between y = 2— 2 and y = /4— 2 when 0 < x < 2, 
but is between y = 0 and y = /4— a when 2 < x < 4. This means that we need to split the 


problem up as the sum of two integrals, i.e., we need to calculate 
2 4 
| (V4—2 — (2-2))dx + | (V4 — x — 0) dz. 
0 2 


Yuck. We would do this if we had to, and find the answer of 10/3, but surely there’s an easier 
way. 


e Even if you want to find areas by always integrating with respect to x, you don’t have to 
do what we did in the previous method. You could just tell yourself that “flipping” the graph, 
interchanging the z- and y-axes, doesn’t change the area of the region. In fact, instead of 


actually flipping the graph — to save time, effort, and space — you can simply relabel the old 


240 CHAPTER 3. APPLICATIONS OF INTEGRATION 


z-axis as now being the y-axis, and the old y-axis as now being the z-axis, and swap all of the x 
and y’s in the equations and in the bounds. This way, we are now looking for the area between 
the curves/lines y = 4 — x? and y = 2-2 for 0 < x < 2, but the region looks the same as it did 
before. Note the new labels on the axes in Figure [3.7] 


Figure 3.7: Make the x-axis the vertical axis, and the y-axis the horizontal. 


Now we know what to do. To calculate the area, we evaluate 


i (4-27) -—(2-2))dx = ‘| (2+2—27)de = 2x4 ==. (3.1) 
0 0 


e But the best method for finding the area of the given region is to realize that, while you could 
switch the x’s and y’s, as we did above, we don’t need to; just don’t panic about using y for your 
integration variable in the first place. Had you not exchanged the x’s and y’s in the first place, 
but instead used y for the integration variable, you would have had y’s throughout Formula|3.1] 


in place of x’s, but of course you’d get the same answer: 


[ (G-v)-@-n)a = f @+y-v) au = 2y 4 = 
0 0 


Geometrically, with the x-axis and y-axis in their usual positions, this means we are taking 


the continuous sum of areas of infinitesimally high rectangles, of height dy, whose length is given 
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by the difference of the x-coordinates on the two curves, as in Figure [3.8] 


Figure 3.8: An infinitesimal rectangle between x = 4— y? and x =2—y. 


Even though it’s really nothing new, for the sake of completeness, and for reference, we give 


the general proposition which relates to the discussion above. 


Proposition 3.2.9. Suppose that we have functions x = f(y) and x = g(y) defined on an 
interval, I, from a to b, and that |f(y) — g(y)| is integrable on I. Then, the area between 
the graphs of x = f(y) and x = g(y), for y in the interval I, is 


b 
/ f(y) — g(a) ay. 


A particular case of Proposition is when one of the curves is the graph of x = 0, ie., 
the y-axis. Then, the integral for the area collapses to just the integral of the absolute value of 


the other function. 
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Example 3.2.10. Suppose, for instance, that you want to find the area between the graph of 


x =siny and the y-axis, for -7 <a< f. 


Figure 3.9: Area between x = siny and the y-axis, —7/4 < y < 7/2. 


We need to calculate ey ,|siny|dy. We find 


mw /2 0 n/2 
/ |siny|dy = / |siny|dy + | |siny| dy = 
—n/4 1/4 0 


0 n/2 0 n/2 
/ —sinydy + | sinydy = cosy + (—cosy) = 
1/4 0 —n/4 
1 1 4—/2 
(2 tere 2 = 
v2 Ch V2 2 


3.2.1 Exercises 


In each of Exercises 1 through 5, find the total area between the z-axis and the 
graph of y = f(x), for the indicated values of x. Also, possibly using a calculator or 
graphing software, sketch the region whose area you are calculating, and include in 


your sketch some typical “infinitesimal” rectangles with “infinitesimal” width dz. 


1. y=sinz,0<a2<27 
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.y=cosrz,0<x2< 27 oe 


-y=9-2?, -4<a4<2 
~y=eta?—-I,-3< 2 <2 


_y=re®), -1l<a2<l 


In each of Exercises 6 through 10, find the area between the y-axis and the graph of 


x = f(y), for the indicated values of y. Also, possibly using a calculator or graphing 


software, sketch the region whose area you are calculating, and include in your 


sketch some typical “infinitesimal” rectangles with “infinitesimal” height dy. 


6. 


7. 


8. 


9. 


10. 


x=siny,0<y<27 
x=cosy,0<y<27 
z=9-y?,-4<y<2 
cay +y?—2y,-3<y<2 


x=ye"),-1<y<l 


In each of Exercises 11 through 20, find the area of the bounded region between 


the graphs of the two given functions. Use integration with respect to whichever 


variable seems most convenient. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


y=3-27, y=-6 eo 


y=3-—27, y=3r-1 


r=3-y’, r=3y-1 


YF 2? Y= 
10 7 
oe 14g?’ 2 14+ 2? 
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20. de esis ia 
1+y? I-y 
In each of Exercises 21 through 36, calculate the total area bounded by the graphs 
of y = f(x) and y = g(x), or « = f(y) and x = g(y), for values of the independent 


variable in the given interval. 


21. f(x) = 27 — 16, g(x) = 9, [-10, 10]. 
22. f(x) =sina, g(x) =cos@, [0,27]. 
23. f(y) =siny, g(y) = sin 2y, [0, 27]. 
24. f(y) =e” + e841, g(y) = e2¥t? + ev”, [—4, 3]. 
25. f(z) = a+ +23 — 3x? +524+5, g(x) = x4 + 2x — 32? + 5a +6, [—4, 6]. ce 
26. f(x) = In(a#* tan) — In(sinz), g(a) = In(e* cos x), [1,3]. 
27. f(y) = 3/(y + 2), gy) = 2/(y — 4), [-1,3].- 
28. Fy) = pe 90) = tam T/A 
29. f(z) = a+ +27 +2, g(x) = 2° + 32, [-3, 3]. 
30. f(x) =tan-12+27 +43, g(x) = 227 +2+tan-12 — 4, [—3, 5]. 
31. f(y) = e734 +4, g(y) =e— 7 + 2, [-5, 5]. 

412 2 
82. Sv) = a atu) = a lh 
33. f(x) =sinh~* 2, g(x) = cosh! a, [1,10]. 

2 

34, f@)= Gye Ey IO = acaesy’| it, 
35. f(y) = lyl, 9(y) = —lyl + 1, [-1/2, 1/2]. 
36. f(y) = 2", g(y) = 3", [-1,2]. 


37. Prove that if f, g and h are continuous functions on [a, b] then the area between the graphs 


of f and g is equal to the area between the graphs of f +h and g +h. 


38. Suppose the area enclosed by two continuous functions f and g on the interval [a, }] is 
A= ie | f(x) — g(x)| dx. Suppose c is an arbitrary real number. Is it true that the area 
between the graphs of cf(x) and cg(x) is cA? If not, how can you correct this statement? 


3.2. 
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39. 


AO. 


Al. 


42. 


43. 


Calculate the limiting area between the curves e~® and e~* between t = 0 and t = U 


where U — oo. Assume a and 0 are positive distinct real numbers. 


Calculate the area between the curves h(y) = Inay and g(y) = In by on the interval [c, d] 
where a, b and c are positive and a # b. 


Calculate the area between the curves a(t) = 1/t and b(t) = 1/t? on the interval [1, U]. 


Does the limiting area converge as U — oo? 
Redo the previous problem with a(t) = 1/t? and b(t) = 1/t?. 


What is the area enclosed by the curves f(x) = sinnx and g(x) = cosnz on the interval 


(0, 27] where n is a positive integer? 


In each of Exercises 44 through 47, use the Midpoint Rule to approximate the areas 


bound by the two curves on the interval [0,1]. Use a partition of n = 4 evenly spaced 


points. 


AA, 


45. 


A6. 


AZ. 


f(z) =e” , g(x) =e”. 
f(a) =e" , g(a) =e", k > 0. 


f(z) =e” , g(x) = e®" , kj m> 0. 


f(x) = V1423, g(x) = 241. 


In Exercises 48 through 51, you are given the area A between the graphs of f(x) 


and g(z) over an interval J. You are also given the two functions with one unknown 


parameter c. Solve for the parameter. 


A8. 


49. 


50. 


51. 


52. 


53. 


A=2A, f(a) = 2a? — 3a +4, g(x) = 2? + cx — 6, I = [0,6]. 
A=-242V/3- 12/6, f(x) =sina, g(x) =c, I= [0,7], c>0. 
A= 16, f(x) =e—|z\, g(x) = |x|, I= [-e,d. 

A=9n- 18, f(z) = Ve — 2, g(x) = c—2, I = [0,c]. 


The equation h(x) = V1 —«? defines a semi-circle of radius one. Let j(#) = C be the 
equation of a horizontal line. Find the value of C such that the line j(z) splits the semi- 


circle into two sections with equal areas. 
Suppose that f is continuous on [a, dJ. 


a. Prove that |f| is integrable on [a, 6]. 


246 CHAPTER 3. APPLICATIONS OF INTEGRATION 


b. Argue that one can choose k = +1 such that kf. f(x) dx > 0. 


b b 
fe F(a) dx] < J |F(a)| de. 
d. Use part (c) to show that a lower bound for the area enclosed between the graphs of 


f° f(a) dx — f? g(a) da. 


54. Argue that if f and h are continuous on [a,b], then the area enclosed by f and the z-axis 


c. Conclude that 


f(x) and g(x) on the interval [a, }] is 


is less than or equal to the sum of the area enclosed by f and h, and the area enclosed by 


h and the x-axis. 


55. Suppose f and g are continuous on [a,b]. Let h(x) = |f(x)| and (x) = |g(x)|. Prove that 
the area enclosed by j and A on the interval [a, b] is less than or equal to the area enclosed 


by f and g on the interval [a, b]. 


It’s often more convenient to describe a function f using polar coordinates. If we 
assume that the distance from the origin to a point on the graph, r, depends only 
on the angle between the z-axis and ray connecting the origin to the point, 6 then 


we can write r = f(@). Under reasonable assumptions, the area of the polar region 


defined for 6 € [a,b] is A= ; ; f?(0)d0. Here, we assume that b—a < 27 to prevent 
double counting. Calculate the areas of the polar regions in Exercises 36 - 40. 

56. r= 0, 6 € [0,7]. 

57. r=sin30, 0 € [0,7/3]. 

58. 4=sinné, 6 € [0,7/n], n a positive integer. 

59. r=1ro, 6 € [0,27]. What familiar shape is this? 


60. r=1+ sind, 6 € (0, 27]. 


The area between two polar regions defined by the functions r|(@) and r2(0) is given 
by A= i \r1(0) — r2(@)| d@. Calculate the areas between the polar curves over the 


given angular interval in Exercises 41 - 45 


61. r1(@) = 1/2, r2(@) = cos 30, I = [—71/9, 77/9]. 
62. r1(0) = 3089, r2(0) = 1+ cos8, @ € [—7/3, 7/3]. 
63. 71 = sind, ro = cos, I = [0, 7/4]. 


64. Recalculate the previous problem by writing the equations of the graphs of the two func- 


tions in cartesian coordinates. 
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65. 


Use the polar area formula to find the area of the annular region enclosed by circles of 
radii Ry and Ro, 0 < Ry < Ro. 


Recall that the area below the graph of a probability density function (pdf) is 


interpreted as a probability. Specifically, if f(x) is a pdf of a random variable 
b 


X, then the probability that X is between a and b is / f(x) dx. In finance and 


a 
economics, the area between two curves can be used to assess the materiality in 


choosing one model over another. We explore this idea in Exercises 46 - 48. 


66. 


67. 


68. 


69. 


Suppose that the XYZ corporation is considering a $200,000 investment. It is known 
with certainty that the most XYZ can lose is its initial investment of $200,000 and the 
most it can profit is $700,000. Analyst A believes the profit is uniformly distributed 
on the interval [—200000, 700000]. Analyst B believes the profit has density function 


x 
A(z) = 117,000, 000° 


a. What is fi(a), the pdf based on Analyst A’s assumptions? 


b. What is the area between fi(2) and f2(x) on the interval [500,700]? What is the 


probabilistic interpretation of this area? 


Under which analyst’s assumptions is it more likely for XYZ to experience a positive 
profit from this investment? 


Calculate the area between the graphs of the functions xf (x) and x f(x) on the interval 
[—200, 700]. What does this area represent? 


Suppose f(x) is a strictly increasing continuous function defined on all reals with the 


a 
property that f(0) = 0. Then f possesses an inverse function g. Just as if f(x) dx is the 
0 


b 


area below the graph of f on the interval [0, al, i g(y) dy is the area to the left of the 
0 
graph of g along the (vertical) interval [0, }]. 


a. Assume a and 0 are positive. Draw a picture to prove that 
a b 
ab< [ payae+ | ly) dy. 
0 0 


This is Young’s Inequality. 


b. Based on the picture, formulate a conjecture about when the inequality is an equality. 
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70. Use the previous problem and the function f(x) = x", k > 0 to prove that 


aP bf 
ab << — + — 
Pp qd 


1 1 
ifa,b>0,1<pand—-+-=1. 
P 4g 


71. a. Suppose f and g are two continuous functions, both of which are positive for all real 
1 1 

xz. Suppose p > 1 is a real number, and q is the unique real such that - + — = 1. 
Pq 


Suppose f and g have the additional properties: 


b 1/p i 1/q 
(/ te?) =1 and (/ tea) =i 


Integrate both sides of the inequality in the previous problem to prove that 


f(w)g(@) dx <1, 


a 


b. Now suppose f and g satisfy all the conditions in part (a) except the last two. Prove 


that ‘9 ’ 
b b P b 1/q 
) f(agle)de < ( . noe is) : ( / [a()]" is) 


This result is known as Holder’s Inequality. 
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> 
3.3 Distance Traveled in Space 


and Arc Length 


In Section 3.1] we looked at an object which was moving in a straight line, and found that its 
displacement and distance traveled could be calculated by integrating its velocity u(t) and the 
speed |u(t)|, respectively. 


Suppose, however, that an object is moving along some curved path in a plane or in space. 
How do we use the velocity and/or speed of the object to calculate the displacement and the 
distance traveled? Amazingly, the answer is that, aside from writing things in terms of vectors, 


the formulas look exactly the same. 


It’s also true that the total distance traveled by an object is equal to the length of the curve 
(or line) that it travels along, provided that the object doesn’t move back over points that it’s 
already hit. This means that we can use the same techniques to calculate the arc length of a 
curve that we use in calculating the distance traveled. 


In this section, we will use a small amount of material on vectors and vector-valued functions, 
such as the material in Appendix|[A] While this discussion could be put off until you take multi- 


variable Calculus, it is really not much more difficult than analyzing motion in a straight line. 


Displacement and distance traveled: 


Suppose that an object is moving in xyz-space, R?, in such a way that, at time t, its z-, y-, 
and z-coordinates are given by continuously differentiable functions x(t), y(t), z(t). Then, the 


position function or position vector or, simply, position of the object at time t is 


When we say that p : [a,b] — R® is continuously differentiable, you may wonder what is 
meant at the endpoints a and b of the closed interval, since you can’t take two-sided limits 
there. We shall always mean that p is defined and differentiable on an open interval which 
contains [a,b], and that p’ is continuous when restricted to the closed interval [a, 0]. 


For a given t value, we sometimes think of p(t) as simply the point (a(t), y(t), z(t)), and, at 
other times, think of p(t) as the vector which is represented by the arrow from the origin to the 


point p(t); the context should always make it clear how we are thinking of p(t). 
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If the object is moving in the zy-plane, R?, we can still think of it as moving in xyz-space, 
but always with z-coordinate equal to 0. Thus, we will discuss motion in zyz-space, and that 
will also tell you what happens for objects moving in a plane; you take our discussion for motion 
in space and either set z(t) = 0 for all ¢, or just omit the z-coordinate. On the other hand, 
diagrams are much easier to produce and understand in the xy-plane; so our diagrams will 
typically show the 2-dimensional situation, and we leave to you to picture the 3-dimensional 


case. 


The velocity (function or vector) of the object is 


While we shall not derive it here, we will mention that, if the velocity vector v(t) is drawn 
as initiating from the point p(t), and v(t) 4 0, then d(t) will be tangent to the curve that the 
object is moving along. See Figure 


Figure 3.10: Velocity vectors are tangent to the curve defined by p(t). 


As the velocity is the derivative of the position, if we are given u(t), we can anti-differentiate 


each component function, and obtain 
[aa = @H)+C., vO) +Oy, AH+0,) = 


=> 


(x(t), y(t), 2(t)) + (Cz, Cy, Cz) = pt) +C, 
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where C,, Cy, and C, are constants. 


Now, suppose that we have two times a and b, where a < b. Then, by definition, the definite 


integral from a to b of a vector-valued function is obtained by integrating each component 


[ewe = ([ vou [vou [voa). 


By applying the Fundamental Theorem of Calculus to each component, we find that this last 


separately; so, 


vector of integrals equals 


The vector p(b) — p(a) is the change in the position of the object between times t = a and t = b; 
as in the case of motion in a line, this change in position is the displacement of the object. The 


magnitude of the displacement is the (straight line) distance between the points p(a) and p(b). 


p(b)-p(a) 


Figure 3.11: A typical displacement vector. 


From our discussion above, we arrive at the more general vector version of Proposition 


Proposition 3.3.1. If the velocity U = u(t), as a function of time t, of an object is con- 
tinuous on the interval [a,b], then the displacement of the object between times t = a and 
t =b is given by 
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Example 3.3.2. A particle’s velocity is given by &(t) = (e', V2, —e~*) meters per second, where 
t is in seconds. What is the displacement of the particle between times t = 0 and t = 1 seconds? 
What is the magnitude of the displacement? 

Solution: 


This is easy. We find the displacement is 


atl) — 510) = | a(t) dt = ; (',V3,-e*)at = (¢,V2t.e~%)|, 
(e, V2,e-1) — (1,0,1) = (e—1,V2,e7!—1) meters. 


The magnitude |p(1) — p(0)| of the displacement vector is 


(1) — BO)| = |(e,V2,e*) - 2, 0,)| = Vee 1)? + (v2)? + (e-? - 1)? = 


Ve? —2e+14+24+e-2—2%-141 = W(e—e-1)? — 2%e+e-1 —3) meters, 


where we have left the answer in this form for a reason that will be apparent in the next example. 


If we look at the example above, and anti-differentiate v(t), we find that the position p(t) is 
given by 
pt) = (vate) + C, 


where C is a constant vector. This means that, between times 0 and 1 second, the particle 
moves from the point (1,0, 1) + G to the point (e, /2,e71) + C, not along a straight line, but 
rather along the curved path p(t) = (e*, V2t,e~t) + C. 

The (straight line) distance between the initial and final points of the particle is the distance 
between (e, V/2,e~) + C and (1,0, 1) + C; this is equal to the magnitude of the displacement 
|p(1) — p(0)|. 

But, what if we want to know the total distance traveled by the particle along the curved 
path given by p(t) = (e’, V2t,e~*) + C? This distance traveled along the curved path should 
(and will) turn out to be bigger than ,/(e + e~!)? — 2(e + e~! — 3). 
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To determine how to calculate the distance traveled by an object, first think about approx- 
imating the distance traveled during a small interval of time, and then we add together the 
resulting small distances, and take the limit as the size of the time interval approaches zero. 
Not surprisingly, this results in an integral. 


Suppose that we have two times tg < t;, and let’s think about what happens when At = t;—to 
is small (close to 0). If At is close to 0, then the distance As that the object travels (along its 
possibly curved path) should approximately equal the distance along a straight line between the 
starting position p(to) and the ending position p(t1); this straight line distance is |p(t1) — p(to)). 
Thus, we have 


As & |p(ti) — p(to)|- 


We can rewrite this distance as 


As & 


where we used that At is positive. As At — 0, (p(to + At) —p(to)) /At approaches &(t) = p’ (to). 


Using differential notation and infinitesimal terminology, this means that the infinitesimal 


distance ds traveled by the object in the infinitesimal time interval dt is 


ds = |a(t)| dt. 


The infinitesimal value ds is frequently referred to an element of arc length. Of course, to obtain 
the total distance traveled, we take a continuous sum of the infinitesimal distances traveled, i.e., 


we take an integral. 


Proposition 3.3.3. If the velocity U = U(t), as a function of time t, of an object is contin- 
uous on the interval [a,b], then the distance traveled by the object between times t = a 
and t = b is given by 


t=b b 
/ ie / \a(t)| dt, 
t=a a 


where ds = |v(t)| dt. 
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Example 3.3.4. Let’s return to the situation in Example A particle’s velocity is given 
by u(t) = (e’, V2, —e~*) meters per second, where t is in seconds. What is the distance traveled 
by the particle between times t = 0 and t = 1 seconds? 


Solution: 


We calculate that the distance traveled: 


| \a(t)| dt = | V (et)? + (V2)? + (—e-*)2 dt = | Ve%t+24+e-% dt = 


[ verter a a [reve = (f-e)| = (e—e')-(1-1) = 
0 0 


Note that, since e + e~! — 3 > 0, the distance traveled is greater than the magnitude of the 


displacement, 


J/(e—e-1)2 — 2(e +e-1 — 3) meters, 


which we found in Example In other words, the distance traveled along a curved path 
between two points is greater than the straight line distance between the points. Good! 


If the velocity v(t) of an object is continuous on the interval [a,b], then we may define the 


distance traveled function between times a and t, where a < t < b, to be 
t 
s(t) = f late))ae, 
a 


where z is just a dummy variable, which we introduce since t is now a limit of integration. 
Then, the first part of the Fundamental Theorem of Calculus, Theorem tells us that 
ds _ 


dt 


Definition 3.3.5. The speed of an object, whose velocity function is continuous, is the 


magnitude of the velocity or, equivalently, the rate of change of the distance traveled, with 
respect to tome. 
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Remark 3.3.6. It is possible to weaken the assumptions that we made about our position and 
velocity functions, and still calculate the distance traveled via integration. 


Assuming that you don’t believe that objects can teleport, then any moving object should 
have a continuous position function. If p(t) is continuous, the distance traveled by an object 
during times ¢ such that a < t < b is the same as the distance traveled during times ¢ such that 
a<t<ob. 


Thus, to calculate the distance traveled, it is enough for p(t) to be continuous on the closed 
interval [a,b], continuously differentiable on the open interval (a,b), and for the integral of 
|u(t)| = |p’ (t)| on the open interval (a, b), as defined in Definition|2.5.11| to exist. This distance 


traveled is thus, again, simply 
b 
f ole. 
a 


provided that this integral exists. 


We should mention that, while we assume, for physical reasons (non-teleportation), that 
the position function is continuous, it is fairly common in physics and engineering to allow the 


velocity of an object to be discontinuous. 


For instance, a pitched baseball may be moving at 90 mph in one direction and then, when 
it’s hit by a bat, moves in the other direction at around 110 mph. The change in velocity of the 
baseball is 200 mph, and for most practical purposes, that changes occurs instantly at the time 
when the ball is struck by the bat. Thus, we think of the velocity function as being discontinuous 
at the time at which the bat strikes the ball. 


Of course, what really happens is that the bat strikes the ball, and the ball deforms while 
making contact with the bat for a very small amount of time. Hence, the velocity function of the 
ball is not truly discontinuous. However, it is so difficult to analyze exactly what happens to the 
ball’s velocity (or, more precisely, the velocity of the center of mass of the ball; see Section [3.8) 
during the tiny time interval of contact with the bat that the situation is usually idealized and 
described as an instantaneous change in velocity of 200 mph. 


This discussion does not merely apply to baseballs; it applies often when one object strikes 
another. If you jump into the air 1 foot, and land on the ground, then, just before you strike 
the ground, your speed downward is approximately 8 ft/s. After you hit the ground, your speed 
is zero. Once again, for most practical purposes, it is reasonable to say that you stop instantly. 


If you think about it, if an object’s velocity changes by some finite amount in zero time, then 
the acceleration of the object must be infinite, and so, by Newton’s 2nd Law of Motion, the 


force acting on the object must be infinite. Thus, we typically say things like “the bat strikes 
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the baseball with an instantaneous infinite force” or “the ground exerts an instantaneous infinite 
force on you when you land on it after jumping”. Such instantaneous infinite forces are usually 


described via the Dirac delta “function”. 


What’s really happening in these cases is that an extremely large force is acting in a com- 
plicated manner over an extremely small period of time. We just idealize the situation to the 


instantaneous, discontinuous, setting. 


Example 3.3.7. Suppose that a particle moves in the xy-plane in such a way that its position 
p(t), in feet, at time t seconds, where —1 < t < 1, is given by 


at) = (,v1—-#) = (0-2). 


Describe geometrically the path that the particle takes, and find the distance that the particle 


travels between t = —1 and t = 1 seconds. 


Solution: 


At time t seconds, «-coordinate of the particle is x = t, and the y-coordinate is y = V/1 — f?. 
Thus, (x, y) is a point on the path that the particle moves along if and only if -l1<a<1,y>0, 
and y = V1 — x”. Squaring both sides of this last equation yields y? = 1—2x?, or «2 +y? = 1. 
Therefore, the path of the particle lies on the circle of radius 1 foot, centered at the origin, 
but only the part where y > 0. Hence, the particle moves along the top half of the unit circle, 


centered at the origin. 


We find the velocity 


for —1 <t< 1. Note that the velocity vector does not exist when t = +1. 


In Figure [3.12] we have drawn the semicircle and indicated, in red, some velocity vectors. 
Note that the lengths of the velocity vectors do not match the speed; we had to scale the 
magnitude in order to fit things in a diagram of reasonable size. What is important for you to 
get from the diagram is the direction of the velocity, and that the magnitude of the velocity, the 
speed, gets larger as the particle gets closer to the points (—1,0) and (1,0), “becoming infinite” 


exactly at these endpoints of the semicircle. 
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Figure 3.12: Velocity vectors have infinite magnitude when t = # = +1. 


We know from high school geometry how far the particle travels; the length/circumference 
of the top half of the unit circle is one half of circumference of the entire circle: (27 -1)/2 =7 
feet. Let’s make certain that the integral Var |u(t)| dt gives us the same answer. 


First, we find the speed, in ft/s: 
2 
—t 1-? t? 1 
u(t = 1 2 st = t = —————5 
bo) Jo i (==) | FF 


Thus, the distance traveled by the particle, between times t = —1 and t = 1 second, is given 


by 
1 1 1 
u(t)| dt = dt. 
[io [. — 


This is an improper integral (see Section |2.5) with two problem points: t = —1 and t = 1. 
Hence, we split the integral, and calculate 


I 1 7 1 t 4 
/ dt = / ee | dp= 
3avV1-2 4i1V1-€ o Vl-# 


dt + lim 


| | 
fit / es F a 
a>-1+ J, V1—t#? bo1- Jo Y1—t? 
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0 b 
) + lim (sin ) = (0— lim, sin“*a) + ( lim sin"'b-0) = 


a——1t b1- 


(-5)+5 = feet 
2 q = eet. 


Whew! It sure took a while to verify that the length of a semicircle of radius 1 is 7. 


Arc length: 


The term length in this context is usually augmented and is referred to as arc length, in 
order to emphasize that we mean the length along something which curves (i.e., along something 


which is composed of “arcs” ). 


Example 3.3.8. If all we want is to use integration to verify that the arc length of a semicircle 
of radius 1 is equal to 7, perhaps it would be better to think of a particle which moves around 
the semicircle at a different speed from that of the particle in Example [3.3.7] 


Suppose that a particle moves in the xy-plane in such a way that its position p(t), in feet, 
at time t seconds, where 0 < t < 7, is given by 


p(t) = (—cost,sint). 


Then, we see that the z- and y-coordinates of the particle are given by x = — cost and y = sint, 
so that «? + y? = (— cost)? +sin?t = 1. Hence, once again, the particle is always on the circle of 
radius 1, centered at the origin. We also see that y = sint > 0 for0 <t < 7, that p(0) = (—1,0), 
and that p(7) = (1,0). Finally, it is important to note that p(t) is one-to-one, i.e., the particle 


is never at the same point at two (or more) different times. 
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-1 


Figure 3.13: Moving around the semicircle with constant speed. 


Thus, as in Example the particle starts at (—1,0) and moves clockwise around the 
semicircle of radius 1, centered at the origin, ending at (1,0). However, the speed at which this 
particle is moving is very different from the speed of the particle in Example [3.3.7] 


We find u(t) = p’(t) = (sint, cost), and so the speed is 


|o(t)| = Vsin?t+cos?t = 1 ft/s. 


Hence, we find that our current particle moves with a constant speed of 1 ft/s. 


Our integral for the distance that the particle travels, i.e., for the arc length of a semicircle 


| |a(t)| dt = i ldt =t = m feet. 
0 0 0 


Comparing this example with Example|3.3.7| we see that selecting/imagining a particle that’s 


of radius 1 is now: 


moving in the “right” way can make the calculation of the arc length of a curve much easier. 


One of the important things that you should have gotten out of the last two examples is 
that, if you want the arc length of a curve, you can think of an object moving along the curve 
and calculate the distance traveled by the object, provided that the object does not travel along 
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any portions of the curve more than once; this means that the position function should be one- 
to-one. We also saw that the calculation of the distance traveled can be made much easier by 


making a nice choice of a position function. 


In fact, it doesn’t matter whether or not we actually think of an object moving along a given 
curve. What’s important is that we have a one-to-one, continuously differentiable function p(t) 
from a closed interval [a,b], where a < 6, into the xy-plane or into xyz-space, that plays the 
role of a position function, in that the points on the curve that we’re interested in are precisely 
the points that you get from p(t), i.e., the curve whose arc length we want is the range of the 
function p. The term simple is frequently used in this context to indicate that jf is one-to-one. 


We also require, for now, a condition, regularity, which, in terms of motion, would say that 
the object never stops at times in the open interval (a, b); this technical condition is very useful 


mathematically. 


Definition 3.3.9. A simple regular parameterization of a curve in R” (e.g., in R?, 
the xy-plane, or in R?, xyz-space) is a one-to-one, continuously differentiable function p 
from a closed interval [a,b], where a < b, into R", such that, for all t in [a,b], p’(t) 4 0. 


The range of a simple regular parameterization, that is, the set of points that p(t) “passes 


through”, is called a simple regular curve. 

We say that a simple regular parameterization p, with domain [a,b], parameterizes the 
simple regular curve which is its range, and that the parameterization starts at the point 
p(a) and ends at the point p(b). 


Remark 3.3.10. We have defined a simple regular curve C as a set of points in R? or R?, 
and you should have an intuitive idea of what a curve looks like, but how can we rigorously, 


mathematically, say when a set of points matches your intuition for what a curve is? 


What we have said in Definition [3.3.9]is that a rigorous definition of a simple regular curve is 
that it is the range of a simple regular parametrization; but keep in mind that, as we saw in the 
previous two examples, a simple regular curve, like a semicircle, can have more than one simple 
regular parameterization. In fact, it is not difficult to show that any simple regular curve has 
an infinite number of different simple regular parameterizations. Nonetheless, as we saw in the 
previous two examples, the curve itself has a length which is independent of the simple regular 
parameterization, i.e., i |p’ (t)| dt will give the same number, regardless of what simple regular 


parameterization you use for a given simple regular curve. 
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Finally, we should mention that, in many references, a curve is defined to be the param- 
eterization p, not the range of this function. This is mathematically convenient, but does 


not agree with most people’s intuition for what a “curve” is, and so we will not use the 
terminology. 


What we have seen in our discussion and examples is that an infinitesimal piece of arc length, 


ds is equal to |p’ (t)| dt, and so we have: 


Proposition 3.3.11. Suppose that C is a simple regular curve in R? or in R°. Then, for 
any simple regular parameterization p(t), with domain [a,b], of C, the arc length of C is 


Curves in the plane: 


In our next examples of arc length calculations, we are going to look at simple regular curves 
in R?, the zy-plane. In the plane, there are three common ways to specify simple regular 


parameterizations. 


One way is to specify appropriate coordinate (or component) functions x = a(t) and y = y(t), 
and then the parameterization is, of course, p(t) = (x(t), y(t)). Then, Proposition |3.3.11] tells 


us: 


Proposition 3.3.12. If p(t) = (x(t), y(t)), fora <t <b, is a simple regular parameteriza- 
tion of a curve C in R?, then 


: dx\* 7 
arc length of C a4 ($) +(4) dt. 


262 CHAPTER 3. APPLICATIONS OF INTEGRATION 


However, you are probably most familiar with curves described as the graph of y = f(x), 
where a < x < b, and where f is continuously differentiable. There is an obvious parame- 
terization here: « = t and y = f(t), ie., p(t) = (t, f(t)). This p is clearly one-to-one, since 
(ti, f(ti)) = (te, f(t2)) immediately implies that t; = tg. In addition, using that t = x, we have 


VC) +) = VB) +) - (> 


Therefore, if f is continuously differentiable on an open interval containing |a, b], then p(t) = 


(t, f(t)) is a simple regular parameterization. 


Proposition 3.3.13. If y = f(x) is continuously differentiable on an open interval con- 
taining [a,b], then 


dy a 
1 we 
+ @ dx 


and the arc length of the graph of f, fora< ax <b, is 


b 2 
dy 
je oe jes 
I ¥ & 


While it’s less common, a curve may also be described by specifying x as a function of y, 
instead of giving y as a function of x. Of course, all this does is interchange the roles of x and 


y. 


Proposition 3.3.14. If x = f(y) is continuously differentiable on an open interval con- 


taining [a,b], then 
dx \? 
1 —]|d 
¥ G i" 


and the arc length of the graph of f, fora<y < }, is 
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dy| \ds 


dx 


Figure 3.14: Infinitesimally, arc length and straight line distance are “equal”. 


All three of the formulas for arc length above are frequently thought of as stemming from 
an infinitesimal version of the Pythagorean Theorem. 


That is, if we perform algebra with the differentials, assuming that dt, dx, and dy are positive, 
then what we are integrating in Proposition [3.3.12] Proposition [3.3.13] and Proposition [3.3.14] 
is always 

ds = /(dx)? + (dy)?, 


which is frequently written as 
(ds)? = (dx)? + (dy)’. 


Before we give a couple of examples, we should mention that the anti-derivatives that arise 
in calculating arc length are usually very difficult, or impossible, to find (as elementary func- 
tions). Such definite integrals can be approximated using methods such as Simpson’s Rule 
(Definition [2.6.5). The examples below are very special. 


2 
Example 3.3.15. Find the arc length of the graph of y = 3 er, forO <a <8. 


Solution: 


We use Proposition |3.3.13] and find 


8 | a 8 8 
= mc = ,/ 1/2)2 = 
arc length i 1+ (+) dx | 1+ (21/2)? da | Vl+<a dx. 
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Figure 3.15: The graph of y = 2x°/?/3. 


We make the substitution u = 1+ 2, which tells us that du = dz, and that, as x goes from 0 to 


8, u goes 1+0=1tol+8=9. Thus, we have that 


_ 52 


yea/2 ° ? (03/2 12/2) 
1 3 


9 
arc length = | wl? du = 3/2 = 


y -y 
fe = cosh(y), for 0 < y < In3. 


Example 3.3.16. Find the arc length of the graph of x = 5 
it 
y 
05 
05 0 05 1 is 2 
-0.5 


Figure 3.16: The graph of x = (e¥ + e~¥)/2. 


Solution: 
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We use Proposition ]3.3.14| and find 


In3 2 In3 Y —e-y 2 
arc length = | 1+ = dy = i, 1+ ssi as dy = 
) dy 0 2 


oe A (ery =o ey ne Hey to 4 (ey 
i ee Se 


In3 y —y 2 In3 Yy = 1 
‘: ey +e ip es | ey +e i 
: 2 2 2 


1 “4 _ 


I 
| 
—™~ 
fay) 
eS 
| 
‘i 
< 
NY 


If some finite collection of simple regular curves intersect each other in a finite number of 
points, then the arc length of all of the curves combined is simply the sum of the arc lengths of 
the individual curves; this is because the overlap at a finite number of points does not have any 
effect on the total arc length. We call the union of a finite number of simple regular curves, which 


may intersect each other in a finite number of points, a piecewise-simple regular curve. 


We can find arc lengths of some piecewise-simple regular curves by using a parameterization 
p(t) that is not one-to-one, but for which a finite number of points might be repeated for a finite 


number of t values. 


Example 3.3.17. Verify, using integration, that the arc length of a circle of radius r > 0 is 


27r. 


Solution: 


Consider the parameterization of the circle of radius r, centered at the origin, given by 


p(t) = (reost,rsint) = r(cost,sint), for 0<t< 27. 


This is not a simple regular parameterization, since the function is not one-to-one: p(0) = 


p(2r) = (1,0). However, this overlap at (1,0) is the only problem; p(t) is continuously differ- 
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entiable, its range is the circle in question, the only “repeated point” occurs when t = 0 and 
t = 27, and 


|p’(t)| = |r(—sint,cost)| = r|(—sint,cost)| = r/(—sint)? + (cost)? = r £40. 


Thus, we claim that the arc length is given by 
27 20 Qn 
| \p’(t)|dt = i rdt = rt = 2rr, 
0 0 0 


even though the parameterization is not one-to-one. 


Why is this calculation correct? Because we can look at the circle C in question as the union 
of the top semicircle C; (including the endpoints at (—r,0) and (r,0)) and the bottom semicircle 
Cy (including the endpoints). Both C; and C2 are simple regular curves, parameterized by the 
restrictions of p to the intervals [0,7] and [7,27], respectively. Thus, C is piecewise-simple 


regular curve and its arc length is the sum of the arc lengths of Cy and C2, i-e., 
wT 27 27 
fwows+ f wo = [wore 
0 T 0 


which is what we found to equal 27r. 


Parameterizing by arc length: 


Suppose that p’: [a,b] — R” is a simple regular parameterization of a curve C, which has 
length L. Just as we defined the distance traveled function (immediately before Definition|3.3.5), 
we can define the arc length function I: [a,b] > [0, L] by 


thus, s = I(t) gives the arc length along C' from the point p(a) to the point p(t). 
As before, the first part of the Fundamental Theorem of Calculus, Theorem tells us 
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that 


As pis a regular parameterization, |p’(t)| > 0, and so the Inverse Function Theorem implies that 
1 has a monotonically increasing, differentiable inverse function, i.e., there exists a differentiable 
function t = m(s) such that m(0) = a, m(L) = 6b, and I(m(s)) = s, for all s in the interval 
(0, L], and m(I(t)) = t, for all t in the interval [a,b]. From the Chain Rule (or the formula for 


the derivative of an inverse function), we have that 


It is easy to check that the composed function g = pom: [0,L] — R” is again a simple 
regular parameterization of C’, and has the property that q(s) is the unique point on the curve 
C which is distance (along C’) s away from q(0) = p(a). 


Definition 3.3.18. A parameterization such as ¢, where the parameter measures arc length 


along C’, is said to be a parameterization by arc length. 


Note that if ¢: [0, L] + R” is asimple regular parameterization, parameterized by arc length, 
then, applying the Chain Rule (to each component of the vector), we find 


|@"(s)| = |(Pom)"(s)| = |m'(s)p"(m/(s))| = |m'(s)||P"(m(s))| = 


1 Ss _ 
IP’ (m(s))] -|p'(m(s))| = 1. 


This, of course, gives us what it had better give us: if we use ¢ to define the arc length 


| \q’(u)|du = a ldu = u 
0 0 


which, in words, says that if you calculate the arc length between the starting point of a (pa- 


function, we get 
s 
— Ss, 


0 


rameterized) simple regular curve and the point that’s the distance s away (along C) from the 


starting point, then what you get is s. If that sounds stupidly obvious, GOOD. 


One final remark on parameterizing by arc length: if we return to the situation of an object 
moving along a curve, with a simple regular position function p = p(t), where ¢ is time, then 


the parameter ¢ is also the arc length (so that p(t) is a parameterization by arc length) if and 
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only if, for all t, |p’(t)| = 1. Thus, a parameterization by arc length is also referred to as a 


parameterization with unit speed, or speed 1. 


More general parameterized curves: 


If we combine parameterizations like those in Example [3.3.7]and in Example [3.3.17] we find 
that we can deal with curves that are the ranges of parameterizations which may fail to satisfy 
various conditions at a finite number of points. We don’t need for our parameterizations to have 
closed intervals as their domains. We also don’t require that ) is continuously differentiable 
everywhere, and so p’ need not be defined at some points, and even the improper integral (recall 
Section |2.5) of |p’ (t)| is not automatically guaranteed to exist. 


All of these things make the definition below very technical. You need to recall our most 
general notion of integrating over fairly general sets of points from Definition }2.5.11| The term 
“rectifiable” , used below, is a technical term which implies that a reasonable notion of arc length 


exists for the curve in question. 


Definition 3.3.19. A rectifiable parameterization of a curve in R” is a continuous 
function p from an interval I, whose interior is non-empty, into R", such that there exists a 
finite set of points F (which could be empty) in I such that, the restriction of p to the points 
of I which are not in F (i.e., to the set I— F) is one-to-one, continuously differentiable, has 


an everywhere non-zero derivative, and such that the (possibly) improper integral J, |p" (t)| dt 
exists. 


The range of a rectifiable parameterization is arectifiable, piecewise-regular, curve. 


Of course, we have: 


Proposition 3.3.20. Suppose that C' is a rectifiable, piecewise-regular, curve. Then, for 
any rectifiable parameterization p(t), with domain I, of C, the arc length of C is 


| Ip" (t)| at. 
I 


3.3. 
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3.3.1 Exercises 


You are given the velocity function v(t) of a object in Exercises 1 - 6. Calculate the 


displacement of the object over the given time interval. 


u(t) = (8sin 2zt, 3 cos 2rt, 4t), [0, 2]. eo 


. a(t) = (¢? +t, 4t3,0), [0,3]. 
. w(t) = (3+ 4t,5 — 7t, 11t), [-3, 7]. 
. (t) 


- (t) = (le), le], Fé), [8,3]. 


(—15,¢+4,sinht), [—5, 8]. 


, 8) = (i lnt, 1/t,e), [1,5]. 


Say whether the given map is a simple regular parameterization of a curve. If it’s 


not, say why it is not. 
7. X(t) = (2t3, 16, cost), t € [-10, 10]. eo 
8. a(t) = (cost, |t + 2|, sint), t € [—7, 0]. 
9. u(t) = (cost, |t + 2|, sint), t € [0,7]. 
10. w(t) = (1,t, t7), t € [-12, 20]. 
11. A(t) = (t3, #3, #3), t © [-5, 5]. eo 
In each of Exercises 12 through 14, give an arc length parameterization of the given 
curve. 
12. a(t) = (cos 5t, sin 5t), ¢ € [0, 27/5). 


13. 


14. 


15. 


16. 


w(t) = (34 6t, 2 — 5t,74+t), t € [0,4]. 
Z(t) = (t°/3,0,t + 3), ¢ € [1,3]. 


Suppose ¢ : [a,b] > R” is a regular curve parameterized by arc length. Show that the 
length of the curve is b— a. 


A regular a : [a,b] — R” is closed if a(a) = a(b). Note that a must still be one-to-one 


everywhere else. 
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a. If a is a closed regular position function, show that the total displacement over the 
interval is 0. 
b. Give a simple example that shows that while the displacement is zero, the total 


distance traveled may be non-zero. 


You are given the position function of a particle. Calculate the total distance 


traveled by the particle over the given time interval. 


17. 


18. 


19. 


p(t) 


p(t) = (sin 3t, cos 3t, 2¢3/7), t € [0, a]. 


B(t) = (t,t, 3), t € [0, 7/4]. 


(t, 12, 0.5¢74), t € [0, 1]. 


Calculate the arc length of the graph of the function over the interval [. 


20. 


21. 


225; Ff 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


f(x) =ax+b,a#40, I = [c,d]. 


g(x 


) =In(sinax), I = [7 /4, 7/2]. 
fly) =e”, a> 0, I = [0,1]. 
= 


h(y) = 5Iny, I = [1,5]. 


f(x) 


1 
+ —. J =(1,2). 
~ T= (1 


3 
Generalize the previous problem. Let a > 0 and f(x) = mate: I = [1,2] and calculate 
the arc length of the graph. 
Suppose p(t) = (acost, asint, bcost, bsint), t € [0,27] is the position function of a particle 


traveling in R*. What is the total distance traveled? 

Parameterize the curve in the previous example by arc length. 

What is the image of the curve (t®, t?), t € R? Is this curve regular? 

Is the path (t,|t|) regular? Is it rectifiable on the in the interval [—1, 1]? 


The intersection of a sphere centered at the origin with radius R and the plane z = 
zy, -R < z < Ris acircle. Give a parameterization of this circle and calculate its 


circumference in terms of zo and R. 


Suppose you make a journey (on the Earth) consisting of three legs. Assume the Earth is 
a sphere centered at the origin with radius R. You start at the North Pole and travel to a 
point on the equator along a line of longitude. You then walk along the equator through 


90 degrees of longitude. Finally, you travel back along the equator. 
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a. Give a piecewise differentiable parameterization for your journey. 


b. What is the total distance traveled? io 


32. In Differential Calculus, we defined the curvature, «, function. If @ is a regular parame- 


T'(t 
terization, then we set «(t) = a | where T(t) is the unit tangent vector. Prove that if 
a 
: ‘ dT 
a is parameterized by an arc length parameter s, then K(s) = |" 
S 


33. Suppose C is the image of some simple regular curve ¢ : [a,b] + R®. Suppose ~ : [e, d] = 
R° is another simple regular parameterization of C. We’d like to make sure that the arc 
length of C is the same whether we use ¢ or 1. 


a. Assume without loss of generality that ¢(a) = ~(c) and $(b) = w(d). Let f : [a,b] > 


. = o dwdu 
1 = 
[c,d] be the function f = W~* od. Let u = f(t) and show that ae da det 


b. Carefully justify the equality: 


‘| "Weld = | “lw'(u)| du 


34. Assume the bottom of a tire with radius r is initially positioned at the origin. The bottom 
of the tire is marked with a red piece of tape. As the tire rolls clockwise, the tape traces 
out a cycloid. The figure below shows a cycloid with r = 2. 


a. Let 6 be the angle between the angle the tape makes with the vertical and let (0) 
be the vector from the center of the tire to the piece of tape. What is a(0)? 


b. Let b(0) be the vector from the origin to the center of the tire. What is (0)? 
c. Using vector addition, determine the position function, p() of the piece of tape. 
d. What is the distance traveled by the piece of tape from 0 < 6 < 22? 


e. Show that the velocity vector vanishes when 6 = 2k7, k an integer. 


Cycloid 


C 


35. What is the arc length of the cycloid between two consecutive cusps? 
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36. Suppose that an outer circle orbits clockwise along the outside of an inner circle. This is 
a common gear configuration. As in the previous problem, suppose a point on the outer 
circle is marked in red. The path traced about by the mark is called an epicycloid. Suppose 
the inner and outer circles have radii r; and r2, respectively and that the outer circle is 


initially positioned on top of the inner circle with the marker touching the inner circe. 


a. Let w be the angle through which the outer circle rotates, and @ the angle between the 
y-axis and the vector connecting the centers of the two circles, measured clockwise. 
Show that the position vector of the center of the outer circle is given by a(@) = 
((r1 + rz) sin 8, (71 + 72) cos @)). 


b. Show that the vector from the center of the outer circle to the red marker is b(w) = 


(—rg sin tra + ra) 2, COS (ra + ray 
rl rl 


c. Justify the equation 714 = row. 


d. Conclude that the position function of the red mark is given by 


p(w) = G +12) sin("2”) ee (ry + rb cian) post) nee (ry + a) 
ry T1 T1 T1 
Epicycloid 


37. Calculate the velocity vector of the epicycloid in the previous problem. 


38. A curve c(t) with domain (—oo, co) is periodic if there exists a constant t) such that 


c(t) = c(t — to). An epicycloid may fail to be periodic, as suggested by the picture below. 
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Determine a necessary and sufficient condition for an epicycloid to be periodic. Hint: 


consider the relationship between r; and ro. 


Epicycloid II 


ut tt 
Le 


39. Suppose c(y) is a parameterization of an epicycloid with infinitely many cusps and that 
r, = 1. Prove that given € > 0, there exists wo such that c(9) is a cusp and the arc length 
between the points p = (1,0) and c(wo) is less than e. Note: there’s nothing special about 
the point (1,0). Given an arbitrary point p on the inner circle, the above argument holds. 


We say the cusps are dense on the inner circle. 
40. Let c(t) = (t, tsin(1/t)) for t 4 0 and let c(0) = 0. 


a. Show that c is everywhere continuous. 


b. Show that c is not a rectifiable parameterization. 


41. If a small circle of radius rz revolves around the inside of a larger circle with radius rj, 
the curve defined by the path of a fix point on the inner cycloid is a hypocycloid. A 


parameterization of a hypocycloid is given below. 


«() = (71 — rz) cos@ + r2 cos (a) 


r2 


(ry — a) 


Uae 


y(0) = (r1 — rg) sin @ — rg sin ( 


Calculate x/(@) and y’(6). 
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42. Show that in a hypocycloid where the ratio of r, to rz is 4 to 1, the coordinates satisfy 


2/3 4 y?/3 — C, where C is some constant. 


43. Students not familiar with the dot product should skip this problem. Suppose 
p and q are two distinct points in R°. We’d like to confirm our intuition that the straight 
line segment connecting these two points is the shortest curve among those with endpoints 
p and q. More precisely, we’ll show that any curve between these two points must be at 


least as long as the straight line segment between them. 


a. Let @: [a,b] + R® be any simple regular curve such that c(a) = p and c(b) = q. Let 
# be any unit vector (so, |@] = 1). Prove that (q—p)-@= f° é’(t) - edt. 


b. Prove that He é'(t)- vdt < £ \e7(t)| dt. Hint: one way to do this is to use the fact 
that Z- 7 = |z||y| cos 6. 


c. Let v= ues and conclude that 


q 
lp —q| 


b 
pal < / If (t)| dt. 


In other words, the length of an arbitrary curve from p to q is at least as long as the 


straight line segment between p and q. 


44. A curve may be parametrized for unbounded time, and yet still have finite length. Let 
p(t) = (ae~' cost, ae” sint) where a,b > 0 and t € R. The image of this parameteriza- 


tion is called the logarithmic spiral. 


a. Calculate |p’ (t)]. 


t 
b. What is Jim |p’ (u)| du? 
— CO 0 
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Logarithmic Spiral 


45. It’s often more convenient to parameterize a curve in polar coordinates rather than Carte- 
sian coordinates. Recall that the relationship between the two coordinate systems is given 


by the equations: 


x=rcosé 


rsind. 


« 
lI 


a. Suppose c(A) = (x(6), y(@)) = (r(@) cos @,r(@) sin@). This means the distance from a 
point on the curve to the origin, r, is a function of 6. Calculate dxz/d@ and dy/d0 via 


the chain and product rules. 


b. Show that 
bo | 2 
dr 
length = 2 — : 
arc leng i eae (5) dé 


Calculate the arc length of the polar parameterized curves in Exercises 46 through 


49 over the given interval. 
46. r=1+sin0, 6 € (0, 27]. 
47. r=C,C >O0a constant, 0 € [0,27]. What familiar curve is this? 


48. r = 3cos0, 0 € [0, 7/4]. ec 


49. r= 6,6 € [0,27]. This curve is called the Archimedean spiral. 
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> 
3.4 Area Swept Out oe" 


and Polar Coordinates 


In this section, we discuss how a parameterized curve in the plane leads to a notion of cumulative 
area “swept out” as points move along the curve. This notion is somewhat analogous to “total 
distance traveled”, in that moving back over the same points gets counted each time you hit the 


points, not counted just once and/or not subtracted when moving in the opposite direction. 


We can, however, use the swept out area to calculate the area of a region, provided that we 
make certain that we don’t hit the same points more than once during our sweep or, really, that 


any overlap occurs for a finite number of parameter values. 


We then apply this to the specific case of curves and regions that are described in terms of 


polar coordinates. 


As in the previous section, we will assume that we have a curve, now in the xy-plane, 
that is described parametrically; thus, we have a function p : [to,t;] —~ R?, where [to,¢,] is 
a closed interval in R. This means that we have functions x = z(t) and y = y(t) such that 
p(t) = (x(t), y(t)). In our current setting, we do not need to be as restrictive as we were when 
discussing arc length. We don’t need a simple, or piecewise-simple, regular parameterization; it 
is enough for us to assume that «(t) and y(t) extend to continuously differentiable functions on 
an open interval containing [to,ti]. It is frequently helpful to imagine a particle moving in the 


xy-plane and to think that x(t) and y(t) give the z- and y-coordinates of the particle at time t. 


We think of p(t) in two different ways: as a point, with coordinates x(t) and y(t), and as 
a vector, represented by the arrow from the origin to the point (x(t), y(¢)). The context will 
always make it clear how we are thinking of p(t). Whether you think of p(t) as the position 
vector of a particle or as a purely mathematical point, for each ¢ value in the interval [a, 0], 
we wish to consider the line segment from the origin to the point p(t) = (x(t), y(t)); we will 
denote this line segment, including the endpoints, by ¢5(4). Note that, if p(t) = 0, then the “line 
segment” f5(4) is really just a point (the origin). 

As t changes, as the “particle” moves, the line segment ¢j(;) “sweeps out” area in the xry- 
plane; see Figure[3.17] If we assume that all line segments ¢5(;) intersect each other only at the 
origin, then this swept out area would simply be the area of the region which is the union of all 
of the fy). 


How do you calculate the area swept out? As you no doubt noticed in Figure[3.17| we labeled 
the points on our parameterized curve as (a(t), y(t)) and (a(t +h), y(t + h)). You may have 
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guessed, correctly, that we are going to estimate the area of the “sector”, the area swept out 
along the curve, between “times” t and t+ h, and then pass to the integral by seeing what 


happens as h approaches 0, i.e., as we change ¢t by an infinitesimal amount. 


(x(t+h), y(t+h)) 


(x(t), y(t) 


Figure 3.17: Area swept out as particle moves. 


However, if we allow the line segments fj) to overlap each other in more than the origin, 
for more than a finite number of t values, then, generally, the area swept out would be more 
than the area of the region which is the union of all of the C4), for the area swept out counts 


the area of a region (always with a plus sign) each time that region is passed over. 


The area of the blue sector in Figure is approximated well, for values of h close to 0, 
by the area of the triangle with vertices (0,0), (x(t), y(t)) and (a(t +h), y(t + h)). But how do 
you calculate the area of a triangle given the coordinates of the vertices? 


Consider a triangle, A, with vertices (0,0), (a,b), and (c,d), as in Figure [3.18] The area of 
A is the area of the triangle with vertices (0,0), (a,0), (a,b), plus the area of the parallelogram 
with vertices (a, 0), (c,0), (a, 6), (c,d), minus the area of the triangle with vertices (0,0), (c, 0), 
(c,d). Thus, we find 


1 
area of A = 5 ab | 


If we switch the positions of (a,b) and (c,d), we would, of course obtain that the area of the 


triangle is (ad — bc)/2. Hence, a formula that gives us the correct answer, without having to 
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know the relative positions of (a,b) and (c,d) is 


area of A = : lad — bel, 


and it can be verified (you can do it!) that this formula is correct regardless of where the points 
(a,b) and (c,d) are located, even if they’re in different quadrants. 


(c,d) 


Figure 3.18: Finding the area of a triangle, given vertices. 


Returning to the sector in Figure |3.17| we find that its area is approximately equal to the 
area of the triangle with vertices (0,0), (x(t), y(t)), (a(t + h), y(t + h)), which we now know is 
equal to 


1 
area of sector = area of A = 5 ljn(é+ h)y(t) — y(t+h)x(2)]. 


When h is close to 0, the definition of the derivative gives us the approximations x(t + h) & 
x(t) +ha'(t) and y(t+ h) = y(t) + hy’(t). Inserting these into the previous formula, we find 
that 


1 


area of sector © (2) +hea'(t))y@) -— (y¥@t+ hy’) (0)| =% |x" ()y(t) — y'(t)x(t)| [A. 


Nl re 


In infinitesimal terms, this means that the infinitesimal amount of area, dA, swept out, along 


the parameterized curve p(t) in an infinitesimal positive amount of time d¢t is 


dA = 5 le’ ()u(t) — y' (Qx@)| at. 
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As is frequently the case in this chapter, we could take the following proposition as a 
definition, but we prefer to use our discussion above as the basis for a proof. 


Proposition 3.4.1. The area swept out by the parameterized curve p : [to,ti] — R?, 
where p(t) = (x(t), y(t)) is continuously differentiable on an open interval containing [to, ti], 
1s 


‘(t)y(t) — y'(t)a(t)| dt. 


This is the area of the region consisting of all of the line segments 54), for to <t< ty, 
provided that, for alltp <t < ty (note the strict inequalities), the ls) intersect each other 
only at the origin, that is, provided that the only overlap (other than the origin) during the 
sweeping is, possibly, at the initial and final t values. 


Note that the line segment from the origin to p(t) # 0 overlaps the line segment from the 
origin to p(r) 4 0, at a point other than origin, if and only if there exists a real number (a scalar) 
A > 0 such that p(t) = A pir), ie., such that x(t) = Ax(r) and, simultaneously, y(t) = A y(r). 


Example 3.4.2. Let’s look at the parameterized curve p(t) given by « = z(t) t?, y 


t) = t?, and -1 <t <1. We sce that x? = ¢® = 7, so that the points in the range of this 
y y 


parameterized curve lie on the graph of y? = 2°. 


Figure 3.19: Area swept out is in green. 


Looking at the curve y? = 2° and noting that y(t) increases as ¢ increases, it may be 


intuitively clear that there is no overlapping (outside the origin) of line segments out to p(t) at 
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various t values. To really show this carefully, suppose that (t?,¢°) and (r?,r°) are not equal to 
0, i.e., that t £ 0 and r ¥ 0, and that there is a \ > 0 such that ¢? = Ar? and #2 = Ar?. We 
want to show that either this implies that t = r, so that “two” line segments intersecting outside 
the origin implies that the parameter values were the same (and so, the “two” lines segments 
were actually the same one), or that t and r are the endpoints of the interval; in this example, 
those endpoints are —1 and 1. 


However, this is easy; just divide the two sides of t? = \r? by the corresponding sides of 
t? = Xr? (which we can do, since t, r, and are not 0), and you immediately obtain that t = r. 


Thus, the area swept out by the parameterized curve is equal to the area of the region 
consisting of the union of the line segments out to points on the curve. In Figure|3.19| the swept 


out area is in green. 


The area swept out is 


1 1 
1 
area =) |x" (t)y(t) — y! (t)x(t)| at =f 5 lt: t — 34° -?| de 7 
-1 


Example 3.4.3. Let’s look at a variant of the previous example. Consider the parameterized 
curve p(t) given by x = a(t) = t*, y = y(t) = t®, and —1 <t < 1. We see that x? = t!? = y?, so 
that the points in the range of this parameterized curve still lie on the graph of y? = x3. Note, 


however, that this time the y-coordinate is always > 0, so that the range of p(t) is only the top 


half of the graph of y? = 2°. 


When t = —1, the point/particle is at (1,1), and moves down the curve as t increases to 0, 
at which time, the point/particle is at the origin. Then, as ¢ increases to 1, the point /particle 
moves back up to where it started at (1, 1). 


It is “obvious” that, as the point/particle moves, it sweeps through the same region twice, 
the region in green in Figure [3.20] To show this, without appealing to the picture, we look at 
points (¢4,t®) and (r4,r®) that are not equal to 0, such that there is a \ > 0 such that ¢4 = Ar4 
and t® = \r®. We want to show that this implies that t = +r, so that, aside from t = r = 0, 


the line segments from the origin out to the curve not only overlap for pairs of values, but that, 


at those pairs, the corresponding points on the curve are actually the same. 
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Figure 3.20: Area swept out is in green. 


This is easy; once again, we just “divide the equations”: t© = Ar® and t* = Ar* (which we 
can do, since t, r, and are not 0), and we obtain that ¢? = r?, and so t = +r. 


Thus, the area swept out by the parameterized curve is equal to twice the area of the green 


region in Figure which, by symmetry, should be equal to exactly what we calculated in 
Example Let’s check. 


The area swept out is 


1 i 
1 
/ |x’ (t)y(t) — y! (t)x(t)| at =) 5 lat? - 8 — 6H? - "| at = 
1 -1 


0 1 
¢10 


+ a 
‘gat 


1 0 iL ¢10 
/ |t|° dt = i —t? dt + | Pdi = - 
= = 0 10 


@ Wee ° 1 ~ @ 


as we expected. 


Area in polar coordinates: 


A classic example of parameterized curves and the area swept out by them is provided by 
curves described using polar coordinates. 


A point with Cartesian coordinates (2, y) is said to have polar coordinates (r,@) provided 
that 


x = rcosé and y=rsiné. 
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You should realize immediately that this definition, and the periodicity of sine and cosine, 
imply that every point (x,y) possesses an infinite number of polar coordinates! For instance, 
the point (a,y) = (1,1) has polar coordinates (7,0) = (W2,7/4), (7,0) = (—V2,37/4), and, 


more generally, polar coordinates 
(r,8) = ((-1)"v2, 2 +n), 


where n is any integer. Even worse, the origin, (x,y) = (0,0) has polar coordinates r = 0 and 


@ = any real number! 


Be aware that some sources require that r > 0 in polar coordinates. While this convention 
makes polar coordinates slightly easier to deal with, it destroys our ability to easily describe 


some beautiful curves in terms of polar coordinates. We shall not assume that r must be 
non-negative. 


Given r and @ for a point P, it is easy to produce the Cartesian coordinates of P; you simply 
use that x = rcos@ and y = rsin@. The question is: given x and y, how do you produce one 
corresponding set of polar coordinates, i.e., given x and y, how do you produce a simultaneous 


solution to z =rcos@ and y=rsin@? 


Squaring and adding yields 


Thus, r= +,/x? + y?. 

We see, of course, that r = 0 if and only if the point is the origin. If we are considering a 
point other than the origin, and decide to use a positive r, then we must have r = \/x? + y?, 
and then a 0, where 0 < @ < 27, is uniquely determined by dividing the equations x = rcos6@ 
and y = rsin@ by r to obtain: 


cos@ = —— and = sind = y 


At this point, you can use inverse trig functions to obtain a 6, as long as you’re careful about 
what quadrant the point is in. Once you have one pair of polar coordinates (ro,09) for a 
point (other than the origin), every other pair that represents the same point is of the form 


((—1)"ro0, 00 + n7)) for integer values of n. 
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For points other than the origin, the most standard choice of polar coordinates has r > 0 
and 0 < @ < 27; with these choices, r is the distance of the point from the origin and @ is the 
counter-clockwise angle between the positive x-axis and the line segment from the origin to the 
point. 


Negative values of @ correspond to line segments which make clockwise angles of |6| with 
the positive x-axis. The relation between a point with polar coordinates (r,@) and a point with 
polar coordinates (—r, 6) is that they lie on the same line through the origin, the same distance 
from the origin, but on opposite sides. 


(x,y) = (rcos 8, rsin 0) 


Figure 3.21: The polar coordinates r and 6. 


Polar coordinates are useful for dealing with problems — Calculus and non-Calculus problems 
— in which data is given in terms of distances and angles, instead of in terms of z- and y- 
coordinates. But what does any of this have to do with parameterized curves and area swept 
out??? 

Well...many cool-looking curves can be described very easily by specifying equations that 


involve the polar coordinates of the points on the curve. For instance, the equation 


r=1+cosé 


describes a curve called a cardioid, as shown in Figure The name is derived from the fact 
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that the shape is vaguely heart-like. 


#, 
x 
i) | ‘| 
<< 
\ 


Figure 3.22: The cardioid given by r = 1+ cos0. 


We had a computer graph this for us, but you can do it by hand. How? It’s not too hard. 
Note that, since —1 < cos@ < 1, we have that 0 < 1+ cos@ < 2, so that r > 0 and, hence, 
represents distance from the origin. Now, just think about what happens. 


As @ increases from 0 to 7, that is, as you’re drawing and you increase your counter-clockwise 
angle with the positive x-axis, cos@ starts at 1 and decreases to —1; hence, the distance that 
your pen or pencil is from the origin, r = 1+ cos@, should start at 2 and decrease to 0 as the 
angle increases to 7. As the angle @ increases from 7 to 27, r steadily increases from 0 back to 
2. As cos@ is 27-periodic, you continue to get the same points as @ increases beyond 27 or if 
you take @ to be negative. 


We would like to apply our discussion from earlier in this section to calculate the area of the 
region inside the cardioid, the green region in Figure In fact, we would like to calculate 
the area swept out by curves given in polar coordinates in the form r = f(0), for 09 < 0 < 44, 


where f is continuously differentiable on an open interval containing the interval [6, 01]. 


This is actually easy for us; given r = f(0), « = rcos@, and y = rsin@, we obtain a 


parameterization of the curve 


PO) = (x(9), y(@)) = (f(A) cos@, f() sind). 
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The area swept out by the curve as 0 goes from 69 to 6;, where 6) < 6; is thus, by Proposi- 
tion equal to 


a 5 |[F(0) cos 6]'f(8) sina — [F(0) sim ]'f(0) cos do 


60 


We leave it as an exercise for you to verify that this integrand simplifies greatly and yields 


the expression in the proposition below. 


Proposition 3.4.4. Suppose that f is continuously differentiable on an open interval con- 
taining the interval [89,01]. Then, the area swept out by the parameterized curve given in 
polar coordinates by r = f(@), where 09 < 6 < 04, is 


[ yr a a * ((6))2 a9 
a & = ~~ Fi 
60 2 A0 2 


Example 3.4.5. Let’s apply Proposition [3.4.4] to calculate the area inside the curve given by 
r =1+cos@. Note that, when we discussed drawing this curve, it was clear that there were no 
overlapping line segments from the origin out to the curve for 0 < 6 < 2a (at points other than 
the origin). The line segment when 0 = 0 and when 0 = 27 are/is the same, but that’s allowed 
in Proposition [3.4.1] and doesn’t affect the area calculation. 


Therefore, the area inside the cardioid given by r = 1 + cos@ is 


1 20 1 20 
= Z (1+ cos@)?d@ = = | (1 + 2cos6 + cos? 6) dé. 
2 Jo 2 Jo 


The hard part of this integral is integrating cos? @. You either use Proposition {1.2.5} with 
n = 2, or use the trig identity cos? 6 = (1 + cos(20))/2 and an easy substitution of u = 20. One 
way or the other, you need to find that [ cos? @d0 = (@ + cos@sin@)/2 + C, so that our area 


integral above becomes 


27 


Nl eR 


1 
(0+ 2sin0+ 5 (6 + cosdsiné) 


0 
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Example 3.4.6. We looked at the cardioid described in polar coordinates by r = 1+ cos@. 
Perhaps surprisingly, the graph of r = 1 + cos(30) looks dramatically different. This graph is 


usually referred to as a 3-leaved rose. 


Figure 3.23: The 3-leaved rose given by r = 1 + cos(36). 


How could you sketch this by hand, and how do you show that the apparent symmetries 


under rotations by 27/3 radians and by reflection about the z-axis, actually are symmetries? 


Once again, 0 < r < 2, and it helps to determine the values of @ that make r equal 0 and 
those that make r equal 2. 


We have r = 0 precisely when cos(3@) = —1, which occurs exactly when 30 = 7 + 27n for 
some integer n. Thus, r = 0 when 0 = 1/3+n(27/3); for 0 < @ < 27, this occurs when 6 equals 
1/3 (n =0), m (n =1), and 57/3 (n = 2). 


To have r = 2, we must have cos(39@) = 1, which occurs exactly when 36 = 27n for some 
integer n. Thus, r = 2 when 0 = n(27/3); for 0 < 0 < 27, this occurs when 0 equals 0 (n = 0), 
27/3 (n = 1), and 47/3 (n = 2). 


Hence, if you had to draw this 3-leaved rose, you could start at 6 = 0 and r = 2. As you 
increase 6 from 0 to 7/3, i.e., as you draw points at an increasing counter-clockwise angle from 
the positive x-axis, from an angle of 0 to an angle of 7/3 (60°), the value of r drops from 2 down 


to 0, i.e., you decrease the distance from the point you’re drawing to the origin from 2 down to 
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0. This is what yields the top half of the right-hand leaf. 


Now, as @ increases from 7/3 to 27/3, you keep increasing the angle at which you’re drawing 
points, and let the distance from the origin increase from 0 back out to 2. This yields upper 
right-hand portion of the leaf on the upper-left of the rose. Now continue in this fashion, until 
you come all the way around to @ = 27, at which point, you will have closed the graph, and the 


points would start repeating if you let @ get bigger. 


How do you verify the apparent symmetries? Notice that 


r = 1+ co0s(30) = 1+ cos(30+2r) = 14+ cos (3(0 + 27/3)) 


This means that a pair (79, 00) satisfies r = 1+ cos(36) if and only if (ro, 99 + 27/3) satisfies the 
same equation; in other words, when you rotate the graph by 27/3, one third of the way around 


a circle, you get the same graph. Thus, we’ve verified the rotational symmetry. 


Now notice that, because cosine is an even function, 


r = 1+ c0s(30) = 1+ cos(—30) = 1+ cos (3(-6)). 


This means that a pair (10, 00) satisfies r = 1 + cos(36) if and only if (ro, —09) satisfies the same 


equation. This proves that the graph is symmetric under reflection about the x-axis. 


What is the area enclosed by the entire 3-leaved rose? Using the two symmetries that we 
verified above, it is enough for us to find the area of half of one “leaf”, and then multiply by 6. 
Recall that we saw that the top half of the right-hand leaf is produced as @ goes from 0 to 7/3. 
Therefore, we find that the area enclosed is 


il n/3 n/3 
Gis s (1 + cos(30))? d= 3 | (1 + 2cos(30) + cos? (36)) dé. 
0 0 


We leave it as an exercise for you to show that this yields 


n/3 Ba 


2° 


7 (0 a 2sn30) 4 30+ o0s(30) mt) 


0 
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We should make one final remark here. We have used the term “3-leaved rose” as a generic, 
qualitative description of the graph of r = 1 + cos(36). A very similar-looking 3-leaved rose, a 
trifolium, has an equation r = 2cos(30); this is a more-standard “3-leaved rose”. We compare 
the two graphs in Figure[3.24| where r = 2.cos(30) describes the inside rose (in black). 


Figure 3.24: r = 1 + cos(36) in blue. r = 2.cos(36) in black. 


We leave it to you as an exercise to show that the area inside this trifolium is 7, which is 
significantly less than 37/2, which agrees with what we see in Figure 


Example 3.4.7. As we saw earlier, r = 1+ cos6 describes a cardioid. You might think that 


r =cos0é 


would also describe some interesting polar curve, a curve that you never saw in terms of Cartesian 


coordinates. However, you’d be wrong if you thought this. 


If you multiply the equation by r, you don’t affect the set of points described by the equation; 
for multiplying by r adds the solution r = 0, but this just describes the origin, which is already 
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a point on the curve, since 0 = cos(7/2). Thus, the curve may be described by 


r? = rcosé. 


But this is easy to change into Cartesian coordinates; the equation becomes 


Subtracting x from each side, and completing the square yields 
2 2 1\? 2_ 1 
x —-x+y° = 0 andso x--]}] +y =7 


which, hopefully, you recognize as an equation for a circle of radius 1/2, centered at (1/2,0). 


You should convince yourself that, in polar coordinates, this circle is swept out as 0 goes 


from 0 to 7; if you went to 27, you’d sweep out the circle twice. 


Of course, we know that the area inside a circle of radius 1/2 is 7(1/2)? = 7/4, but we can 


“check” this by calculating the integral 


[sre = tal ai: 
0 2 0 2 


As we saw in Proposition J cos? 6.d0 = (6 + cos @ sin @)/2 + C, and so, we conclude that 


Tv 


6+ cos@sin 9 
4 


area = 


3.4.1 Exercises 


In Exercises 1 through 5, you are given the component functions x = x(t) and y = y(t) 


for a parameterized curve. a) Find the area swept out by the parameterized curve 
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over the given interval. b) Think of the parameterized curve as giving the position, 
at time t, of a particle in the zy-plane. Eliminate the parameter t, to arrive at an 
equation containing only x and y; sketch the graph of this curve, and decide how 
the particle moves along the curve during the given times. Shade in the area swept 


out. 


l.ex=ty=t?;0<t<2. 

E TT 
2. CSR US Oe SS 
3. c=, y=t?; -1<t<1. 


4.2 =e, y=e*; -1<t<1. 


5. c=Int,y=t;1<t<3 


6. Suppose that a > b > 0. Find the area swept out by the curve p(t) = (t%,t°), for t in the 
interval [0, 1]. 


In Exercises 7 through 12, you are given a curve r = f{(@), described in polar coor- 


dinates. Sketch the curve and find the area it encloses. 
7. r=1+4+siné. 


8. r=1+sin(40). 


9. r=5+4cos8. 
10. r? = 9sin(20). 
1l. r= 2(1+sin@). 


12. r = 3cos(50). 


1 
13. Find the area inside the big loop and outside the small loop of the graph of r = 5 + sin 0; 
this figure is known as a limagon. See Figure 
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1.5 


Figure 3.25: The limacon given by r = 0.5 + sin @. 


14. Find the area swept out by the spiral « = (2 — t)cost, y = (2—t)sint,0 <t< 2. See 
Figure |3.26 


Figure 3.26: The spiral « = (2 —t)cost, y = (2—t)sint,0<t< 2. 


15. Find the area swept out by the exponential spiral z = e~!cos(10t), y = e~* sin(10#), 
0 <t<oo. See Figure}3.27 
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Figure 3.27: An exponential spiral. 


16. The most famous example of area swept out appears in Kepler’s 2nd Law of Planetary 
Motion. 


When a planet moves around the sun, Kepler determined that the planet moves in an 
ellipse, contained in a plane (which we take as the xy-plane), with the star at one of the 
foci. Placing the sun at the origin in the xy-plane (heliocentric coordinates), a planet 


moves along an ellipse given in polar coordinates by 


Pp 


an 1+ecos@’ 


where p > 0 and 0 <e< 1. The constant € is the eccentricity of the ellipse. 


Figure 3.28: A planet, moving in an ellipse around the sun. 


Applying Newton’s Law of Universal Gravitation and vector Calculus, it is not terribly 
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difficult to show that, as functions of time, the position of a planet given by 6 = @(t) and 


_ _ Pp 
Pos 1 + €cos (0(t)) ’ 


satisfies the differential equation 


a. Show that F ah 
ape | ese) 
dt dt 
b. Derive Kepler’s 2nd Law of Planetary Motion: a planet sweeps an equal area in an 
equal amount of time, i.e., given an amount of time T' > 0, the amount of area that 


the planet sweeps out between times t and t+ T is independent of t. (Hint: Do not 


explicitly use that r = p/(1 + €cos@).) 
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> 
3.5 Volume 


In this section, we will discuss how integration allows us to calculate the volume of solid regions 


for which we’re given the area of cross sections which are perpendicular to some axis. We will 
look at many examples in which the solids are solids of revolution, solids obtained by revolving a 
plane region around an axis and looking at the solid region which is “swept out”. In the context 
of solids of revolution, integrating the cross-sectional area is referred to as the disk method or 


washer method, due to the shapes of the cross sections. 


We will also look at a second method for finding the volumes of solids of revolution; this 
method has some aspects that are similar to the disk and washer methods, but it does not use 
planar cross sections of the solid. This second method for finding volumes of solids of revolution 


is the cylindrical shell method. 


Suppose that we have a solid region, S, in space, and that, after fixing a coordinate axis, 
say the x-axis, we know the area of every cross-sectional slice of S which is perpendicular to the 
z-axis, and that the solid region lies between the x cross sections where x = a and x = b, where 
a<b. 


Figure 3.29: A typical x cross section of a solid. 


In order to determine the volume of S, given the cross-sectional area function A(x), we take 
partitions of the interval [a,b], and look at Riemann sums of estimates of the volume over the 


i-th subinterval, of width Ax;, using a sample point s; in the subinterval. Our estimate for the 
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volume over the i-th subinterval is the area A(s;) times a small thickness Ax;. Thus, we obtain 
Riemann sums of the form }°, A(s;)Az;. 


Figure 3.30: A typical summand in a Riemann sum for volume. 


Taking the limit, we obtain: 


Proposition 3.5.1. If a solid region lies between the x = a and x = b cross sections, and 
the cross-sectional area function A : [a,b] — R of the solid is continuous, then the volume 
of the solid region S is 


| ae 


Remark 3.5.2. Of course, we generally think of the situation infinitesimally. At each x- 
coordinate between a and 0, the infinitesimal volume dV of the solid S above the infinitesimal 
interval from x to x + dz, of infinitesimal width dz, is given by dV = A(a) dz, and the total 
volume is the continuous sum of the infinitesimal chunks of volume dV as x goes from a to 0, 


i.e., 


«w=b b 
volume of S' - | dV = | A(a) dx. 


w=a 
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Figure 3.31: An infinitesimal piece of volume in a solid. 


Example 3.5.3. Suppose that we have a rectangle of area B (for “base” ) which lies in the plane 
at a fixed positive z-coordinate H (for “height”). If we connect all of the points of the rectangle 
to the origin via straight lines, then the set of the points on all the line segments form a solid, 
an upside-down pyramid. We would like to determine the volume of this pyramid in terms of B 
and H. 


If we had a formula for the z cross-sectional area, A(z), for 0 < z < h, then we would simply 


[ A(z) dz, 


but how do we obtain a formula for A(z)? 


calculate 


Call the length of each of one pair of parallel sides of the base rectangle LZ and let W be the 
length of each of the other sides. Each z cross section of the pyramid is a rectangle, of a length 
and width that varies as z varies. Let | = I(z) and w = w(z) denote the length and width of 
the cross-sectional rectangle at z, with 1 and w chosen to correspond to the “same” sides as L 


and W, respectively. 
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Area B 


Figure 3.32: An upside-down slanted pyramid. 


By using similar triangles in the faces of the pyramid, we find that the length I(z) and width 
w(z) satisfy: 


z 
val W HH 
Therefore, the cross-sectional area A(z) is given by 


Iw, B 
7.” 2 


Now it’s easy to find the volume. We calculate 


H H 3 3 
B B H BH 
volume = / A(z)dz = a2 de = : 
0 0 


The formula that we just derived, that the volume was one third of the area of the base 
times the height, may seem familiar to you; it’s the formula for the volume of a cone. You may 


be thinking to yourself “wait, we didn’t have a cone; we had some upside-down slanty pyramid 
thing”. 


It’s true that, when most people picture a cone, they picture a right circular cone, i.e., a 


disk (a filled-in circle) connected by line segments to a vertex which lies on the line through the 
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center of the circle, perpendicular to the plane containing the circle. However, the object that 
we looked at in Example is also a type of cone; it, and right circular cones, are particular 


cases of the following general definition. 


Definition 3.5.4. Suppose that we have a region R, contained in a plane P in R°, and 
have a point v in R°, but not in the plane P. 


Then, the set of points on all line segments connecting points of R to the point v is called 
the cone, with base R and vertex v. 

If R contains at least 3 non-collinear points, so that there is only one plane P containing 
R. then the (perpendicular) distance from vertex v to the plane P is called the height of 
the cone. 


Figure 3.33: A cone whose base is a general plane region. 


Thus, a right circular cone is a (general) cone, and so is the upside-down slanty pyramid 
thing from Example [3.5.3] However, the really cool thing about Example [3.5.3] is that it gives 
us a formula for the volume of any cone, at least, any cone that has a base which has some area. 
Why? Because to approximate the area of a non-rectangular base, you partition the region into 
little rectangles (with some rectangles possibly sticking out of the region, or not covering part 
of the region near the boundary), and add together all of the areas of the little rectangles; then 
you perform a limiting operation, as the rectangles get arbitrarily small. 


But, each partition of the plane region into little rectangles partitions the cone into a collec- 


tion of narrow cones with rectangular bases, i.e., narrow cones whose volume we calculated in 


Example [3.5.3] 


Thus, we find 


3.5. VOLUME 299 


Proposition 3.5.5. The volume V of a general cone, of height H, whose base has area B, 


as 


Now let’s turn to another classic geometry formula: the formula for the volume inside a 
sphere. 


Example 3.5.6. We wish to derive the well-known formula for the volume V inside a sphere 
of radius R: 


V= SR. 


Figure 3.34: A hemisphere of radius R. 


For convenience, we will use the sphere of radius R which is centered at the origin. This 
sphere is the set of points where x? + y? + z? = R?. We will determine the volume inside the 
top hemisphere, where z = \/ R? — x? — y?, and then double the result. 


Each z cross section, for 0 < z < R, of the hemisphere yields a circle of some radius r(z), 
and so the cross-sectional area of the solid is A(z) = n(r(z))”. We need to find the function 
r(z), or (r(z))*, calculate the integral for x(r(2))" dz, and then double the result. 


To find a formula for (r(z))’, you need to sketch a picture, draw the “correct” auxiliary line 


segment, and use the Pythagorean Theorem. You should quickly find that z? + (r(z))” = R?, 
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and we therefore obtain the very simple formula (r(z))* = R? — 27, Thus, A(z) = 1(R? — z”) 


and 


4 
We conclude what we wanted, namely that V = 3 aR. 


Solids of revolution: 


The sphere, or hemisphere, that we looked at, above, is a particular example of a solid 
of revolution; the hemisphere is “swept out” by revolving, around the z-axis, a quarter of a 
disk, specifically, the quarter disk, in the 1st-quadrant of the yz-plane, that’s inside the circle 


of radius R, centered at the origin. 


Figure 3.35: The solid hemisphere as a solid of revolution. 


We can revolve other plane regions about other axes. We assume that our region B and our 
axis of revolution @ are in the same plane, and that B does not contain points on each side of 
£, i.e., B is completely on one side of ¢ or the other, possibly having an edge along ¢. We also 
assume that each cross section of B itself by a line (in the plane) perpendicular to ¢ is a line 


segment, a point, or is empty. 


With these assumptions, when we revolve B in space, about @, each of the cross-sectional 
line segments will sweep out either a disk (a filled-in circle), if one end of the line segment is on 
£, or will sweep out what’s known as a washer, a big disk minus a smaller disk with the same 
center; you get a washer when the revolved cross-sectional line segment in B does not have an 


endpoint on ¢. (The technical term for a washer is an annulus.) 
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For instance, we saw a typical swept out disk in Figure|3.34| when we were calculating the 
volume of the hemisphere in Example An example that yields washers is given by revolving 


the bounded region between the graphs of y = x? and y = x around the y-axis (we return to 
y Yy 


this example in Example|3.5.10); see Figure [3.36] 


Figure 3.36: An example in which the cross sections are washers. 


The area of a disk is, of course, 7R?, where R is the radius of the disk. Thus, if our axis of 


revolution is the x-axis, then the x cross-sectional area A(x) of our solid of revolution will be 


A(z) = n(R(x))? = wR’, 


where R = R(x) is the radius of the disk, i.e., the length of the x cross-sectional line segment in 
the plane region B. 


If the cross sections are washers, then there’s the outside radius R = R(x) of the washer, 
and the inside radius r = r(x), the radius of the “missing” disk. In this case, we have that the 


x cross-sectional area A(z) of our solid of revolution will be 


In terms of the x cross-sectional line segment in the plane region B, r(x) is the distance from 
the x-axis to the closest end of the line segment, and R(x) is the distance from the x-axis to the 


farthest end of the line segment. 
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It is a common, but terrible, mistake to write that the area of a washer is 7(R—r)?, instead 


of 7(R?—r?). Just keep in mind that it’s the differences between the areas inside two circles, 
and you should have no trouble getting it right. 


Example 3.5.7. Consider the region B in the xy-plane which is below the graph of y = 2”, 
and above the interval [0,2] on the x-axis. Revolve the region B around the z-axis. Find the 
volume of the resulting solid of revolution S. 


Figure 3.37: The cross sections perpendicular to the x-axis are disks. 


Solution: 


The cross section of S at any x in the interval [0,2] is a disk, whose radius is the y-coordinate 


2 2 


on the curve y = x7, ie., x7. So, the radius function is R = R(x) = 27, 


A(z) = wR? = x(a”)? = aa, 


dV = A(x)dx = na‘ dz. 
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Therefore, we find that the volume is 


Example 3.5.8. Consider the bounded region B in the ry-plane which is “trapped” between 
the graphs of y = x? and y = 2, i.e., the region between 2 = 0 and x = 1 that’s above the graph 
of y = x? and below the graph of y = x. Revolve the region B around the z-axis. Find the 


volume of the resulting solid of revolution S. 


J/Jo¥ 
y i ct 
y 
1 
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Figure 3.38: The cross sections are now washers. 


Solution: 


Unlike the plane region in Example |3.5.7| the current region B does not abut the z-axis; 
there is a gap between B and the axis of revolution. Thus, our cross sections of S, perpendicular 
to the x-axis are not disks, but are, instead, washers. 


The inside radius, the radius of the hole in the washer, r(x), at the x cross section is the 


distance from the axis to the closest part of B, i.e., to the corresponding point on the curve 


given by y = x”. Therefore, r = r(x) = 2”. 
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The outside radius, the big radius of the entire washer, R(x), at the x cross section is the 
distance from the axis to the farthest part of B, i.e., to the corresponding point on the curve 


given by y = x. Therefore, R = R(x) = x. 


Hence, our cross-sectional area is 


Example 3.5.9. What if we consider the same region as in Example|3.5.8} but revolve the region 
about the line given by y = 2? Then, what volume do we obtain for the solid of revolution S$? 


Solution: The line @ given by y = 2 is parallel to the x-axis, so that cross sections of the region 
B, perpendicular to @, are the same as the cross sections perpendicular to the x-axis, and = still 
goes from 0 to 1. We obtain washers as the cross sections of the solid S, with an inside radius 
r =r(a) and an outside radius R = R(2). 


It takes some thought to come up with the functions r(x) and R(«). Certainly, looking at a 


diagram helps. 


The inside radius r(x) is the distance from @ to the closest curve; that closest curve is given 


by y =a. The question is: at a fixed x-coordinate, what is the distance from y = 2 to y= x? 


If you think about it, and look at the diagram, hopefully you come up with the answer. 
The distance r(a) is the distance from y = 2 to the z-axis, minus the distance from the x-axis 
to the corresponding point on y = x. The distance from y = 2 to the z-axis is obviously 2. 
The distance from the x-axis to the corresponding point on the graph of y = x is simply the 


y-coordinate on the graph, namely x. Therefore, r = r(x) = 2 — a. 
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Figure 3.39: The cross sections are washers, but our axis of revolution has changed. 


Similarly, the outer radius of the washers, the distance R(x), is the distance from y = 2 to 
the z-axis, minus the distance from the x-axis to the corresponding point on y = 2”. We find 
REA) S2=—2". 


Hence, our cross-sectional area is 


n(4a — 5a? + x‘), 


an infinitesimal piece of volume is 


and the volume of the solid of revolution is 


x=1 1 
V= / dV = | m(4a — 5x? + x*) dx — 
x=0 0 
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Example 3.5.10. What if we consider the same region as in Example |3.5.8| but revolve the 


region about the y-axis? Then, what volume do we obtain for the solid of revolution $? 


Figure 3.40: Washers around the y-axis. 


Solution: Now, cross sections perpendicular to our axis of revolution are y cross sections, which 
are still washers. However, as we are using y cross sections, we need to write everything in terms 


of y. 


The region B lies between the lines y = 0 and y = 1 (it is a special property of the region B 
that makes the y “limits” the same as the x limits; these would not be the same, in general). 


The inside radius r(y) of the cross-sectional washer is the z-coordinate on the closest curve 
to the axis of revolution, i.e., the x-coordinate on the graph of y = x. Thus, r = r(y) = y. The 


2 


outside radius R(y) is the x-coordinate on the farthest curve, i.e., on the graph of y = 2°. In 


terms of y, this x-coordinate is \/y. Therefore, R = R(y) = \/¥. 


Hence, our cross-sectional area is 


3.5. VOLUME 307 


and the volume of the solid of revolution is 


y=1 1 y? y? 
re) w= [ my-P)y = 0 (E-4 
y=0 0 


For solids of revolution, there’s a minor change in how we want to think about obtaining the 
infinitesimal pieces of volume that we get by taking cross-sectional area and multiplying by an 


infinitesimal thickness. 


Suppose that we let our axis of revolution be the z-axis, that we have a region B in the 


xy-plane, and that we generate a solid of revolution by revolving B in space around the x-axis. 


Then, what we’ve been doing is taking cross sections of our solid $, perpendicular to the 
x-axis, and producing an infinitesimal chunk of volume dV = A(x)dx, where A(z) is the area of 
the x cross section of S. Instead of revolving and then taking cross sections, we’ve seen that we 
can first take cross-sectional line segments in the plane region B, perpendicular to the x-axis, 
and revolve those around the x-axis to obtain disks or washers with area A(x). We then multiply 
by an infinitesimal thickness dz. 


The slight change in thinking that we wish to bring up now is that we could have first 
multiplied the cross-sectional line segment in B by dz, to obtain a rectangle of infinitesimal 
width, and then revolved this rectangle about the z-axis to generate our thickened disks or 


washers. 


Why look at things in terms of revolving infinitesimally thin rectangles, perpendicular to 
the axis of revolution? Because it leads us to consider calculating infinitesimal volumes, and so 


total volumes, in a different way. 


Cylindrical Shells 


We now wish to take cross-sectional line segments in B which, instead of being perpendicular 
to the z-axis, are parallel to the z-axis. This means that we look at cross sections perpendicular 
to the y-axis. For each y-coordinate for which we have points in the region B, let h(y) be the 
length of the cross-sectional line segment in B at the given y-coordinate. We thicken these 
line segments to obtain infinitesimally wide/thick rectangles by multiplying by an infinitesimal 
thickness, which is now dy (not dz). We then revolve these rectangles around the z-axis, 
generating infinitesimally thin (right circular) cylinders (think of aluminum cans, with no tops 


or bottoms); these are called cylindrical shells. 
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Figure 3.41: A typical cylindrical shell. 


The surface area of a cylinder (still with no top or bottom) is the circumference times the 
height, so A(y) equals 27rh, where r is the radius of the cylinder and h is its height. Since we’re 
rotating around the z-axis, the radius, in terms of y is simply y, while the height h is what we 
wrote above for the length of the cross-sectional line segment, namely h(y). We multiply this 
area by the infinitesimal thickness dy to obtain the infinitesimal piece of volume 


dV = 2ryh(y) dy. 


Therefore, if the region B lies between y = a and y = b, where a < b, and we generate a 
solid of revolution by revolving B around the z-axis, we find, using cylindrical shells, that the 


volume is 
y=b 


b 
V - | 2mrhdy = 7 2ry h(y) dy. 
y a 


=a 
Of course, if you revolve a region around a different axis, then the formula above needs to be 
changed accordingly. 


Example 3.5.11. Let’s return to the solid of revolution from Example[3.5.7] We had the region 
B in the xy-plane which is below the graph of y = x, and above the interval [0,2] on the x-axis. 
We revolved the region B around the x-axis, and found the volume V of the resulting solid of 
revolution S by using disks. We calculated that V = 3217/5. 
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$ ote 


Figure 3.42: Volume by cylindrical shells. 


We will now redo this problem, using cylindrical shells. As you will see, the integral that 
we obtain will not look very much like the disk-method integral; nonetheless, we’ll get the same 
answer. We’d better! 


As we discussed above, when using shells, we see that the y-coordinates of the region B are 
between 0 and 4. For each y-coordinate between 0 and 4, we find that the length h(y) of the 
cross-sectional line segment is 2 minus the distance from the y-axis to the curve y = x7; this 
distance is the x-coordinate on the curve, which is \/y. Thus, h(y) = 2 — ,/y, and the volume 
of the solid of revolution is 


y=4 4 4 4 
V= / dV = i 2rrhdy = | Qny(2—y/?) dy = an f (2y — y*/?) dy = 
y=0 0 0 0 


Math works again!!! 


Example 3.5.12. You may wonder why you would ever prefer using cylindrical shells over 
using disks or washers to find the volume of a solid of revolution. 
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Consider the curved triangular region B in the xy-plane, which has part of the y-axis as its 
left edge, its top edge is given by y = 2 — 2, and its bottom curved edge is given by y = \/Z. 


This region lies between x = 0 and « = 1, and between y = 0 and y = 2. 


Let’s revolve B around the y-axis and calculate the volume of S, the resulting solid of 


revolution, both by using disks and by using cylindrical shells. 
By disks: 


If we use disks, then we take cross sections of B that are perpendicular to the axis of 
revolution — here, the y-axis. You may immediately see the issue that arises; the formula for the 
radius of the disks changes as the y-coordinate passes through y = 1, the y-coordinate where 


the graphs of y = 2— 2 and y = \/z intersect. 


Figure 3.43: Volume by disks. 


For 0 < y < 1, the radius of each cross-sectional disk is determined solely by the graph of 
y = Va (and the y-axis). The radius, r = r(y) is the x-coordinate of the corresponding point 
on the graph of y = \/z; thus, r(y) = y? for0<y <1. 


However, for 1 < y < 2, the radius of each cross-sectional disk is determined solely by the 
graph of y = 2— 2. Again, the radius, r = r(y) is the x-coordinate of the corresponding point 
on the graph, but now it’s the graph of y = 2 — z; solving for x in terms of y, we find that 
r(y) =2-—yforl<y<2. 


How do we deal with this changing formula for the radius of the disks? By splitting the 


integral. 
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1 2 
[uta + | mtu taAyay = x] 
0 1 


By cylindrical shells: 


However, even though the integrals that appear when using the disk method are easy, many 
people find it aesthetically unpleasing to have to split the integral up into two pieces; if you use 
cylindrical shells, there is no need to split things up. 


When using cylindrical shells, we take cross sections parallel to the y-axis, i-e., perpendicular 
to the z-axis, for 0 < « < 1. For a given 2-coordinate between 0 and 1, the height h(a) of the 
cylindrical shell, which is the length of the x cross-sectional line segment in B, is the difference 
between the corresponding y-coordinates on the two graphs. Thus, h(a) = (2— 2) — a. The 
radius r(a) of the cylindrical shell is the distance from the x cross-sectional line segment in B 


to the y-axis; this distance is simply x. 


Figure 3.44: Volume by cylindrical shells. 


Therefore, the infinitesimal volume of each cylindrical shell is given by 


dV = 2nrhde = 2nx(2—a2—--2'/*) da, 
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and so the volume is 


a= a 1 
Vo= i dV = {. Qna(2—2—21/*) dx = | Qn(2Qa — 2? — 23/?) de = 
2=0 0 0 


A final note on this example: 


You should convince yourself that, had we revolved the region B around the z-axis, instead 
of around the y-axis, then using washers would not require us to split our integral into two 


pieces, but using cylindrical shells would have. 


Example 3.5.13. In the previous example, we saw that using cylindrical shells might sometimes 
be easier for calculating volumes of solids of revolution. However, which method to use is not 


always a question of mild convenience or aesthetics. 


Consider the region B under the graph of y = f(x) = x—<° and above the interval [0, 1] on 
the x-axis; see Figure 


Figure 3.45: The region B under y = x — 2°. 


If we revolve B about the x-axis, the volume V of the resulting solid of revolution is easy to 


calculate via disks; we obtain 
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vw Qa? gl 
(F aa aes =) 


However, attempting to calculate the volume V using cylindrical shells leads to insurmount- 


1 


@ sal 


able difficulties. We can set f’(”) equal to 0 and find that the maximum y value for 0 < x < 1 
is 4/55/4 = 0.53499, but we have a bigger problem. Looking at Figure [3.45] you can see that, 
for each y value between 0 and 4/ 59/4 there are two corresponding x values on the graph of 
y = x«— 2°; call the smaller one x(y) and the larger one x2(y). Using cylindrical shells, we 


would obtain 
4/5°/4 
V= | 2ry(x2(y) — r1(y)) dy, 
0 


but we need formulas for x; and 22 in terms of y. This means we need to solve y = x — x° for 
x; however, there is no “nice” formula for the solutions to general quintic equations, and so we 


are completely stuck. 


You may be thinking “ah - while the shell method sometimes gives nicer solutions, it can 
run into insurmountable problems that the disk/washer method does not have”. This is not 
the case; you should revolve the region B about the y-axis, instead of about the x-axis, and 
convince yourself that using cylindrical shells is now the manageable method, while, this time, 


using washers leads to the problem of solving y = x — x° for 2. 


The moral of the story should be an obvious, general principle: when you have multiple 
techniques for obtaining solutions, and one of them doesn’t work, try another one! 


Example 3.5.14. (Gabriel’s horn) In the final example of this section, we will calculate the 


volume of a solid of revolution, even though the solid “goes out to infinity”. 


05 


Figure 3.46: The region B under y = 1/x and above [1, 00). 
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Consider the region B under the graph of y = 1/a and above the interval [1,oo) on the 
z-axis. When you revolve B around the z-axis, the solid of revolution S$ that is generated looks 
like some sort of long, narrow, trumpet, a “horn”, where the mouthpiece has been stretched out 
to infinity. Of course, the solid region should be filled in, but we’ve drawn just the surface in 


order to make it look more horn-like. 


Figure 3.47: Gabriel’s Horn. 


We'll use disks to calculate the volume of S. Our disks are cross sections perpendicular to 
the x-axis; the radii are given by r(x) = 1/z, and so dV = x(1/x)? dx. The total volume is 


L=00 [eve) b 
V= | dV = i rae ?dx = lim mae *dx = “| lim is 
x 1 


boo Jy 


3.5.L Exercises 


Suppose that the solids in Exercises 1 - 4 have cross-sectional area A(x) for x in the 


given range. Calculate the volume of the solids. 


1. A(x) = 827 4+1,2<2<4. eo 
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2. A(x) = cosa, —17/2 <a < 7/2. 

3. A(z) =2-29,0<2¢<1. 

4. A(x) =2ra,3<4<6. 

5. Suppose S is a general cone with a square base lying in the plane z = 10. Each side of the 
base has length 10 and the vertex of the cone is the point (0,0,2). What is the volume of 
the general cone? 

6. A frustum of a pyramid is constructed by “chopping off” the top of the pyramid. More 
specifically, let Ry and Ry be squares in the planes z = 0 and z = 10, respectively. R, has 


sides of lengths 12 and Rz has sides of lengths 4. Then the frustum S is the of points on 
all line segments connecting R; and Rz. What is the volume of S? 


7. Redo the previous problem where the side lengths of R, and Rp are a and J, respectively, 
and where R2 lies in the plane z = h. Assume R;, still lies in the plane z = 0. 


8. Suppose that a solid has its base in the zy-plane, and that each cross section, perpendicular 
to the z-axis, for 0 < x < 1, is a (filled-in) square, one of whose sides goes from the z-axis 


out to the curve y = x”. See Figure Find the volume of the solid. 


J 


Figure 3.48: Figure 3.49: 


9. Suppose that a solid is formed in such a way that each cross section perpendicular to the 
x-axis, for 0 < « <1, is a disk, a diameter of which goes from the x-axis out to the curve 
y = Va. See Figure Find the volume of the solid. 


Determine the volume of the region obtained by revolving the region lying below 


the graph of the given function and above the z-axis about the specified axis. 
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10. y =sina, z-axis, x in [0,7]. 

ll. y=a2?2+2+1, y=10, a in (0,2). 

12. y=Inq, z-axis, x in [1, }]. 

13. y = 27, a-axis, x in [0,1]. 

14. y= a? +k, z-axis, x in [0, 1). 

15. y=", x-axis, x in [0,1] and n> 0. 

16. e+y=c,c> 0, a-axis, x in (0, c. ec 

17. y =sinz, about the line y = 2 for x in [0,7]. 

18. y=2+sinz, about the line y = 2 for z in [0, z]. 

19. Revolve the region in the first quadrant bounded by the lines f(a) = x, and g(x) = 5(a—3), 
about the z-axis. What is the volume of the solid of revolution? 

20. Revolve the region bounded by the graphs of h(x) = e~*, j(a) = 2 and the line x = 1 
about the z-axis. What is the volume of the resulting solid? 

21. Let a and b be the two solutions in [0,7) to the equation g(x) = k(x) where g(x) = sina 
and k(a) = sin2a. What is the volume of the solid obtained by revolving the region 
defined by the graphs of f and g and the lines x = a and x = b about the z-axis? 

22. Let h(x) = 2? + 3a — 1, k(x) = 72 — 4. What is the volume obtained by revolving the 


region bounded by the graphs of these two functions about 


a. the x-axis; 


b. the y-axis? 


Approximate the volume obtained by revolving the region lying below the graph of 


the 


function and the z-axis about the z-axis. Use the midpoint method of approx- 


imation along with the shell method of calculating volume. Partition the interval 


of integration into n = 4 evenly spaced subintervals. 


23 


24 


25 


26 


. f(x) = Vsing, «x in [0,7]. 
. h(v) = In(sin az) + 3, « in [1/4, 37/4]. 
. g(z) = tan e”, x in [—4,0]. 


. j(x) = cosh(1/z), x in [1, 5]. 
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27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


k(x) = €*, x in [0,7]. 


Let u(z) = 2” and v(x) = 2” where m > n > 1. What is the volume obtained by 
revolving the ic) bounded by the graphs of these two functions for x in [0,1] about the 


z-axis? 


What is the volume obtained by revolving the region described in the previous problem 


about the y-axis? 


Suppose C is a right circular cone with its vertex at the origin, and with its circular base 
in the plane z = h. Assume the radius of the base is r. What is the average area of a cross 
section parallel to the zy plane? Express your answer in terms of the volume of the cone 
and its height. 


Suppose G is a general cone with vertex at the origin and with its base in the plane z = h. 


What is the average area of a cross section parallel to the xy plane? 
Suppose that f > 0 on the interval [0, |], that f is continuous, and that F' is an anti- 
derivative of f. Prove that the volume obtained by revolving the region below the graph 


b 
of f(a) and above the z-axis about the y axis is V = —on f F(x) daz. 
0 


_ {sina 


Let f(x) 
below the graph of f and above the z-axis about the x-axis. Does this region have finite 


for « > 1. Consider the solid obtained by revolving the region lying 


volume? Hint: compare the volume to another region with a known volume. 


2 2 
Recall that the equation - + P = 1 defines an ellipse. Let R be the portion of the ellipse 
a 


lying in the upper-half plane, y > 0. 


a. What is the volume obtained by revolving the region about the x-axis? 


b. Set a@ = b in your result from part (a) and verify that this is the correct volume 


formula for a sphere. 


Let f(#) = cosha and g(a) = «+1. Let a and b be the two x values where the graphs 
of the functions intersect. Calculate the volume obtained by revolving the area between 


these two curves on the interval [a,b] about the z-axis. 


Calculate the volume obtained by revolving the region in the previous problem about the 


y-axis. 


A torus in three dimensions can be obtained as follows. Let D be a disc in the xy plane 
with center (R,0) and with radius r where r < R. Rotate the disc about the y-axis to 


realize a torus. For the next few problems, we refer to r as the inner radius and R as the 


outer radius. Use the disc method to calculate the volume of this torus. 
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38. 


39. 


AO. 


Al. 


42. 


43. 


4A. 


45. 


Another way to construct a torus is to bend a cylinder until the top and bottom meet. 
The volume of the resultant torus is identical to that of the original cylinder. Use the 
setup in the previous problem and calculate the volume of the torus by calculating the 


volume of a cylinder. 


Calculate the area obtained by revolving the region below the graph of y = sin x and above 


the x-axis on the interval x in [0,7] about the y-axis using: 


a. The shell method; 
b. The disk/washer method. 


Suppose a torus shaped inner-tube is floating in a swimming pool. The inner radius is 4 
inches, and the outer radius is 18 inches. A portion of the inner tube is submerged below 
the water level. If the water level is half an inch, what is the volume of the submerged 
portion of the inner tube? 


Suppose a spherical volleyball with radius 65 cm is floating in a pool so that the ball is 
submerged up to 5 cm. What is the volume of the submerged portion of the ball? 


Suppose an object with volume V is partially submerged in some fluid. Let V; be the 
volume of the submerged portion of the object. Suppose the density of the object is po 
and the density of the fluid is py. A basic result of fluid statics states that 


Vite fe 
Vo py 


Calculate the densities of the inner tube and volleyball mentioned in the previous two 
problems if the density of water 1 gm / cm?. 


Suppose that the volleyball of radius 65 cm is again floating in water. The ratio of the 
volumes of the submerged portion of the ball to the unsubmerged portion is 1 : 4. To what 
depth is the volleyball submerged? 


Suppose we try to generalize Gabriel’s Horn. 


a. Let f(a) = 1/2” and consider the solid formed by revolving the region lying below 
the graph of f and above the z-axis about the x-axis on the interval x in [1,00). Does 
the volume exist when n > 1? If so, what is the volume? 


b. Does the volume exist when 0 < n < 1? If so, what is the volume? 
Calculate the volume of the solid obtained by revolving the region bounded by the graph 


of y = 1/2, the line x = 1 and the line z = a where 0 < a < 1, about the y-axis. Does this 


integral converge as a — 0? 
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A6. 


AT. 


48. 


49. 


50. 


51. 


52. 


53. 


54. 


55. 


Let R be the upper half of the disc centered at the origin with radius r. What is the 
volume of the region obtained by revolving R about the line y = b where b > r and b < 0. 


Let f(x) = cx? and g(x) = c?\/x where c > 0. Let V. be the volume obtained by revolving 
the region bounded by the graphs of f and g about the z-axis. What is V,? 


Calculate the volume of the region defined by revolving the region lying below the graph 
of y = sin(x”) and above the z-axis about the y-axis. Assume z in [0, V7]. 


Suppose an annulus with inner radius r and outer radius R is situated in the plane z = 7. 
The center of the annulus is the point (2,700, 7). What is the volume of the general cone 


with base the annulus and with a vertex at the origin? 


Calculate the volume, if it exists, of the solid obtained by revolving the region lying below 


the graph of y = seca and above the z-axis about the x-axis on the interval [0, 7/2]. 


Let g(x) = e* ge [0,co). Calculate the volume, if it exists, of the region obtained by 
revolving the region lying below the graph of g(x) and above the z-axis about the y-axis. 


Ho 


What is the volume obtained by revolving the region lying below the graph of y = e7 
and above the z-axis about the z-axis on the interval [0, 00)? 


Consider the function 


r<0- 


w= {0 ae 


a. What is the volume obtained by revolving the region lying below the graph of f(x) 


and above the z-axis about the z-axis on the interval (—co, 00)? 


b. Use the disc method to try and calculate the volume obtained by revolving the region 
below the graph of f(a) and above the x-axis around the y-axis for a < y < 1. Does 


this integral converge as a — 0? 


c. Try calculating the volume in part (b) using the shell method. Does your answer 
concur with that in part (b)? 


A right circular cone is obtained by revolving the region bounded by the curves y = mz, 
x = 0, and y = h, where m # 0, about the y-axis. What is the volume of the cone? Leave 


your answer in terms of m and h. Make sure to consider the case that m is negative. 


Archimedes noted that if a right circular cone, a hemisphere, and a cylinder all have the 
same heights and radii, then the ratios of their volumes is 1 : 2 : 3 in the order specified 
above. Verify this claim. Observe that if we append a complete sphere to the tail of this 
sequence, the ratio pattern extends to 1:2:3:4. 
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56. Suppose a regular n-gon (equilateral, equiangular and convex) is situated in the plane 
z = h with its center on the z-axis. Recall that the apothem is the distance from the 


center of the polygon to the midpoint of one the sides. 


a. Prove that the area of the n-gon with apothem a is given by B, = a?n tan a 
n 


b. What is the volume, V,, of the solid defined by the line segments connecting the 


origin to the polygon? 
ah 
c. Show that B, — ma? and V,, > Ee 


sense? 


as n — co. Geometrically, why does this make 
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> 
3.6 Surface Area 


In this section, we will look at surfaces of revolution; these are the outer surfaces (or, parts of 


the surfaces) of solids of revolution, which we discussed in the previous section. We will use 
the definitions and results on arc lengths of curves from Section [3.3] and will briefly recall the 
needed material here. 


Suppose that C is a simple regular curve (Definition |3.3.9) in the xy-plane. We want to 
revolve C around some line £, the axis of revolution, look at the surface of revolution that is 


swept out, and find its area, i.e., calculate the surface area of a surface of revolution. 


Consider an infinitesimal piece of arc length ds on C, which is a distance r from ¢@. Then, 
ds “looks” infinitesimally like the length of part of a straight line which is parallel to @, and 
the infinitesimal area dA swept out by ds is that of a right, circular cylinder of radius r and 
height ds. (It is not trivial to show that the tilt and straightening of the arc length to make it 
“parallel” to the axis is negligible, compared to the “infinitesimal” arc length; it is nonetheless 
true.) Thus, 


dA = 2rrds. 


Figure 3.50: A negligible tilt of the arc length. Figure 3.51: Revolving around the axis. 


Recall from Proposition|3.3.12/that, for a simple regular parameterization p(t) = (a(t), y(t)), 


dx \? dy . 
to = f(#) + (#) a 
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Therefore, we have: 


Proposition 3.6.1. If p(t) = (x(t), y(#)), fora < t < b, is a simple regular parameterization 
of C, and r(t) equals the distance from p(t) to €, then the area of the surface of revolution 
generated by revolving C around £ is 


t=b 


surface area = | 


t=a 


As in Section [3.3] if we are given the curve C as part of the graph of a function y = f(a) or 
x = f(y), then ds can be written as 


| dy | dx \? 
1 —j)d 1 — 
+ (3) x or + (=) dy, 


respectively; if you were using one of these forms for ds, then you would need to write the distance 
to @, the “radius” r, in terms of x or y, respectively. Rather than rewrite Proposition in 


every conceivable form, we shall, instead, look at four specific examples. 


We should remark at this point that, as with arc length integrals, the integrals /anti-derivatives 
that arise in trying to calculate the areas of surfaces of revolution are usually ridiculously dif- 
ficult or impossible (to obtain as elementary functions). Of course, you can approximate the 
integrals very closely by using numerical techniques from Section [2.6] The examples that we are 


about to give are very special ones, for which we will be able to obtain exact areas. 


Example 3.6.2. You may already know a/the formula for the surface area of a sphere of radius 
R, but let’s calculate it as the area of a surface of revolution, and make sure that we get the 


well-known result. 


Consider the upper semi-circle C’ of radius R, centered at the origin in the xy-plane. This 


semi-circle has a simple regular parameterization given by 


p(t) = (a(t), y(t)) = (Roost, Rsint), forO<t<qa, 
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and the sphere of radius R, centered at the origin, is the surface of revolution obtained by 
revolving C' around the x-axis. As we are revolving around the z-axis, r = r(t) is the y- 


coordinate on C, i.e., r= Rsint. 


We calculate 


ds = a an dy i = V(-Rsint)? + (Reost)? dt = Rdt 
NV ae dt ~ =e 


and 


t=T wT 
surface area of a sphere of radius R = / 2ards = | 2n(Rsint) Rdt = 
t 0 


2nR?(—cost)|” = 2nR?(—(-1)—(-1)) = 4nR?, 


which is what you may have learned in high school. 


Remark 3.6.3. You may look at the formula V = orR3 for the volume inside a sphere of radius 
R and the formula A = 47 R? for the surface area of the sphere, and notice that dV/dR = A. Is 


this just a coincidence? No. 


Just as we used integrals involving cylindrical shells, thickened cylinders, to calculate the 
volumes of solids of revolution in Section [3.5] we could have used integrals involving spherical 
shells, infinitesimally thickened spheres (remember: the sphere is just the surface, not the inside) 


to calculate the volume inside a sphere. 


Let A(r) denote the surface area of a sphere of radius r. Then, consider the volume inside 
a sphere of radius R. For each r, where 0 < r < R, we can consider the infinitesimal volume 
of the infinitesimally thickened sphere of radius r (with the same center as the big sphere); this 


infinitesimal volume is 


dV = A(r)dr, 


and the total volume inside the sphere of radius R is 


R 
V(R) -{ A(r) dr. 
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assuming that we already know that V(R) = 4rR3, we obtain that 


4 R 
-TR? = | A(r) dr. 
3 0 


Now, the first part of the Fundamental Theorem of Calculus, Theorem |2.4.7| tells us that the 
derivative, with respect to R of the right-hand side above is simply A(R), which means that, if 
we differentiate both sides of the equality, we obtain 


4nR? = A(R), 


which is what we found in Example 


Example 3.6.4. Let C be the portion of the graph of y = x? between x = 0 and x = 2. Revolve 
C around the y-axis and find the area of the resulting surface of revolution (which looks like 
some sort of rounded cup). See Figure 


Figure 3.52: Graph of y = x”, 0 < x < 2, revolved around the y-axis. 


Solution: 


We will calculate 
t=2 
surface area = / 2rr ds. 
x=0 
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As we are given y as a function of x, we want to use that 


dy . 2 
ds = y+ (2) dx = \/1+ (2x) dx = V1+4+ 4a? dz. 


We also need to write r as a function of x. But, as we are revolving C’ around the y-axis, the 


distance to the axis of revolution, r, is simply the x-coordinate on C, i.e., in terms of 2, r = x. 


Thus, the integral that we obtain is: 


2 2 
surface area = | QnaV1+4a2dx = | Qna(1 + 4x?)!/? de. 
0 


0 


Hopefully, you see fairly quickly that the substitution u = 1+ 4a? will enable us to evaluate 
the integral. We find that du = 8xdz, or du/8 = xdz. We also see that, when x = 0, 
u=1+4-0? =1 and, when x = 2,u=1+4+4-2?=17. 


Hence, we have: 


ie 
1 
surface area = an | ull? . . du = —--~| = —((17)9/?-1]. 
1 


Example 3.6.5. Let C be the portion of the graph of y = x? between x = 0 and x = 2. Revolve 
C around the x-axis and find the area of the resulting surface of revolution. See F igure [3.53] 


Solution: 


Once again, we will calculate 


surface area = / 2rr ds, 
x=0 


and, as we are again given y as a function of x, we want to use that 


dy . 2 
ds = 1+ ($) dx = /1+(82?)° dx = V14+9a4dz. 
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Figure 3.53: Graph of y = x3, 0 < x < 2, revolved around the z-axis. 


We again need to write r as a function of x. But, this time, as we are revolving C’ around the 


z-axis, the distance to the axis of revolution, r, is the y-coordinate on C, i.e., in terms of x, 


r=y=2°. 


Thus, the integral that we obtain is: 


2 2 
surface area = | 2na?V14+9at+dr = i Qna?(1 + 9ar*)!/? de. 
0 0 


The substitution u = 1+924 will enable us to evaluate the integral. We find that du = 36a° dz, 
or du/36 = x°dx. We also see that, when x = 0, u = 14+ 9-0+ = 1 and, when x = 2, 
u=1+9-2*= 145. 


Hence, we have: 


145 
surface area = an f ul/?.— du = 
1 


Example 3.6.6. (Gabriel’s horn, revisited) In Example|3.5.14| we looked at the region B 
under the graph of y = 1/a and above the interval [1, oo) on the z-axis, revolved B around the 


z-axis, and found that the resulting solid of revolution, Gabriel’s horn, had volume 7. 
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Figure 3.54: Gabriel’s Horn. 


In this example, we wish to show that the surface area of Gabriel’s horn is infinite. (When 
we write “surface area” here, we mean the area of the “sides”, i.e., we are excluding the disk that 
could fill the flared end of the horn at « = 1. However, as the surface area is infinite without 
the disk at x = 1, the surface area would certainly still be infinite if we included the disk.) 


The curve that we are revolving around the z-axis is the graph of y= 1/x = a7! for x > 1. 


We find 
du\? 
ds = 1+ (2) dx = \/1+(-a#-?)?* dx = V1+a-‘4dz, 
\ x 


and r, the distance from a point on the graph to the z-axis, in terms of xz, is given by the 


y-coordinate of the graph of y = 1/2, ie., r = 1/z. 


Therefore, 


ia 1 ey 
surface area = / Qn-—-V1+a-4dx = 27- lim —-V1l+a-‘4dz. 
x 1 & 


1 b— co 


1 
An easy substitution will not let us find an anti-derivative of —-./1 +274. However, note that, 
x 


for x > 1, 
1 1 
—-Vl+a-4 > -. 
x 


Thus, Theorem |2.3.20| tells us that, if b > 1, then 


8 


b b 
1 1 
i, Vite tds > / a = Ine]; = Inb. 
1 1 
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As b goes to infinity, so does Inb, and this forces the larger quantity 4 1 -V¥1+2-‘* dz to go to 
infinity also. We conclude: 


surface area of Gabriel’s horn = 27-00 = o. 


Remark 3.6.7. The results of Example and Example [3.6.6] tell us that Gabriel’s horn 
has finite volume, but infinite surface area. These results are sometimes described as “you can 
fill Gabriel’s horn, but you can’t paint it”. The clever student then asks “What if you filled the 
horn with paint? Wouldn’t that paint the surface?” . 


This seeming contradiction is caused by a lack of precision in claiming that having infinite 
surface area means that a surface can’t be painted. What is true is that having infinite surface 
area implies that the surface cannot be painted with a finite amount of paint, if we are required 
to have a uniformly thick layer of paint everywhere. However, if it were possible to have 


arbitrarily thin layers of paint, then the surface of Gabriel’s horn could be painted. 
We'll try to describe a similar problem, in which it’s hopefully easier to see what’s going on. 


Suppose you took a cube that’s 1 foot long on each side, and you fill it with paint. Then, 
the volume of paint is finite; it’s 1 ft?. The surface area of the cube is the combined area of the 
6 sides, namely, 6 ft?. 


Now, imagine chopping the cube in half by a cut which is parallel to two of the faces, while 
simultaneously sealing the two new exposed sides (or, you could think of inserting dividers into 
the cube first, then chopping the cube in half). The total volume of paint in the two half-cubes 
is still 1 ft?, but now the surface area has gone up, because we created two new faces; the surface 
area is now 6 + 2 = 8 ft?. 


Now, by making a cut parallel to the original cut, divide (and seal) one of the two half-cubes 
from above; the volume of paint remains 1 ft?, but we added two more faces, for a new surface 
area of 10 ft?. Imagine continuing this process indefinitely, each time, taking one of your smallest 
two pieces, and dividing it into two pieces by making a cut parallel to all of the other cuts. The 
volume of paint is always 1 ft?, but the surface area gets arbitrarily large or, in the limit, is 


infinite. 


Is there a contradiction here? No, but note that the layer of paint on the sides of the smaller 
and smaller pieces can’t be any thicker than the width of each piece, which is getting arbitrary 


small (close to zero). 
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3.6.1 Exercises 


Calculate the area of the surface obtained by revolving the graph of the function 
about the z-axis. Recall that the notation x € [a,b] means x takes on values in this 
interval. In the first problem, for example, the portion of the graph of y = 3x +4 


between x = 3 and x =7 is being revolved. 


1. y=30+4, x € [3,7]. eo 


2. y=sing, x € [0,7]. 
3. y=e", x € [0,7]. 

4. y=cosha, x € [0,1]. 
5. y= Vz, x € [0,9]. 


6. y =sinha, x € [0,10]. 


Calculate the area of the surface obtained by revolving the graph of the function 


about the y-axis. 


7. y = 2x —3,0€ [4,7]. 

8. y = 427, x € [0,6]. ie 
9. y=a/3, x € [1,8]. 

10. y= 2°, x € (0, 2]. 


11. y=cosha, x € [0,4]. 


Calculate the area of the surface obtained by revolving the graph of the function 


about the given axis. 


12. y= 32+13, 2 =0, x € [—4, 0]. 


13. y= 3a+1,c=4, x € [0,4]. 
14. y= 27, y=1, x € [0,1]. 


h.ga=r eS 1 ae 0; 1), 
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16. y=a—5,y=3, 7 € [5,8]. 
17. y=ma+b,m40, y=), x € [0,1]. 


Approximate the areas of the surfaces obtained by revolving the graph of the func- 


tion about the y-axis. Use the Midpoint Rule with n = 4 evenly spaced partitions. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


y =Ing, « € [1,9]. 

y= 2? +3r+5, re [0,12]. 
y=o',x € [3.7]. 
y=a*, 2 € [4,8]. 

y = tanha, zx € [0,4]. 


ese € [0,1] 
ae ’ * 


Suppose that f(a) is a positive differentiable function. Argue that the surface area obtained 


by revolving the graph of f(x) about the z-axis on the interval [a, | is 


b 
Anon | f(a) 1+ [f' (x)? de. 


Given the setup in the previous problem, what is the surface area obtained by revolving 
f(x) about the line y = yo? Assume f(x) > yo for all x € [a,b]. 


Suppose again that f(x) is a positive differentiable function. Argue that the surface area 
obtained by revolving the graph of f(x) about the y-axis over the interval [a, }] is 


A= an f ays Lf/(a)|° dx. 


Assume a > 0. 


Given the setup in the previous problem, what is the surface area obtained by revolving 
the graph of f(a) about the line x = x? Assume 2p < a. 


a. What is the surface area obtained by revolving the line y = ma about the x-axis for 
x € [0,a]? 
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b. If the surface area function you obtained in part (a) is S(m), is S a continuous 
function of m? Is it differentiable? 

29. Let a,b > 0. What is the area obtained by revolving the curve y = aVx+b about the 
x-axis for x € [0,1]? 

30. Consider the torus obtained by revolving the circle (x — 2)? + y? = 1 about the y axis. 
Calculate the area of the torus by computing the area of the upper half of the torus and 
multiplying by two. 

31. a. What is the surface area obtained by revolving the line y = mz about the y-axis for 

x € [0, a]? 
b. If the surface area function you obtained in part (a) is S(m), is S a continuous 
function of m? Is it differentiable? 

32. Consider the horizontal line segment in the xy-plane with end points (1,3) and (2,3). 

a. Find a parameterization of the line segment. 

b. Find the area of the surface obtained by revolving the segment about the y-axis. 

c. What familiar shape is this? Check that your answer to part (b) is correct by calcu- 
lating the area using classical geometry. 

33. Consider again the horizontal line segment in the xy-plane with end points (1,3) and (2,3). 

a. Find the area of the surface obtained by revolving the segment about the y-axis. 
b. What familiar shape is this? Check that your answer to part (a) is correct by calcu- 
lating the area using classical geometry. 

34. Let C' be a right circular cone obtained by revolving the line y = x about the y-axis. 
Suppose that sand is poured into the cone. oO 

a. Calculate the surface area of the sand in the cone as a function of y, the height of 
the sand in the cone. Include the circular base of the cone in your analysis. 
b. If S(y) is the surface area of the sand in the cone, what is dS/dy? 
35. Prove that the distance between a point (x1,yi) and the line y = mz is given by the 
equation 
qa Mme 
m2? +1 
36. Assume f(x) is a continuous function with f(a) > ma. Use the previous problem to 


derive an integral equation for the area obtained by revolving the graph of the function 
f(x) about the line y = mz for x € [a,c]. o 
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37. 


38. 


39. 


AO. 


Al. 


42. 


43. 


4A. 


Generalize the result in the previous problem. How does the equation for the surface area 


change if the axis of rotation has the more general equation y = ma + b? 


Let @ be the angle the line y = ma makes with the z-axis. What is the surface area of 
the figure obtained by revolving the graph of f(x) about the line y = mz in terms of 0? 
Assume «x € [a,c]. Hint: no need to start from scratch. Use the previous problems and 


find an equation relating m and 0. 


With the notation of the previous problem, derive formulas for the surface area in the 
special cases. Assume that in all cases, the graph of f(x) lies on just one side of the axis 


of rotation. Assume x € [a,c]. 


a. 0=0. 
b. 0=n/4. 
c. 0=7/2. 


In light of these results, justify the statement "If a graph of a function lies in the first 
quadrant and on one side of the line y = mz, then calculating the surface area obtained 
by revolving the graph about y = mz is easy if we can calculate the area obtained by 


revolving about the z-axis and y-axis. 


What is the surface area of the region obtained by revolving the line segment y = 2x + 4, 
x € [3,5] about the line y = 2x? 


1 
Let D be a the disk with radius 3 centered at the point (4,2). Then the line y = gt cuts 
the disc into two equal hemispheres. Calculate the surface area of the figure formed by 


revolving the hemisphere about the line y = g@ 


Let C be the circle defined by the equation (x — 3)? + (y — 3)? = 4. What is the surface 
area obtained by revolving the portion of C’ above the line y = x about the line y = x? 


What is the surface area obtained by revolving the graph of the function h(a) = 4a — 2 
about the line y = 2a for x € [1,5]? What shape is this? 


In Exercises 45-49, setup an integral to calculate the surface area of the region 


obtained by rotating the graph of f(x) about the given line. Do not evaluate the 


integral. 


45. 


A6. 


AT. 


f(a) =e", x € [2,5], y = 3a. 


f(x) = cos? 2, x € [-1,0], y=-=. 


xr+2 
= 2,4 = 22. 
f(x) ra tel ly x 


3.6. 
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A8. 


A9. 


50. 


1 
f(z) =2?+5r4+1,v€ [0,3], y= 3° 
f(z) =Va+1, z € [3,8], y = 4a. 


Suppose C’ is a piecewise simple regular curve. Formulate a reasonable equation for the 
volume obtained by revolving C around an axis. 
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3.7 Mass and Density 


You are probably familiar with the concept of density, mass per volume. However, you may 
have never thought about “instantaneous density”. The issue is that objects may have different 
densities at different points. The density 6(P) at a particular point P is defined by taking 
“small” chunks (rectangular solids) of the object around P, looking at the mass of such a 
chunk divided by its volume, and taking the limit of this process as the volume of the chunk 
approaches zero. A serious discussion of this limit belongs in multivariable Calculus, but this 


informal description should suffice for the applications in this section. 


In terms of derivatives, we usually write that 6(P) = dm/dV, i.e., that the density is the 
instantaneous rate of change of the mass, with respect to volume. Or, in terms of infinitesimal 
quantities and differentials, the infinitesimal amount of mass dm at P is given by dm = 6(P) dV, 
where dV represents an infinitesimal volume in the object at P. Because it will help with the 
later discussion, we will borrow notation and terminology from multivariable Calculus, and use 


‘ 


undefined terms like “manageable type”, a “continuous function of three variables”, and an 


“integral over a solid region”, so that we can state: 


Proposition 3.7.1. Suppose that a density function 6(P) is continuous throughout a man- 
ageable solid region S. Then, an infinitesimal piece of mass at a point P in S' is given by 
dm = 6(P) dV, where dV is an infinitesimal volume in S' around P, and the total mass of 
S is the integral of 6, with respect to the volume, over the solid region S, 1.e., 


mass of S = [am - [oPyav. 
8 Ss 


Okay. Great. But now your question should be: How can we have a density which varies 
in a solid object, and still deal with the situation using single-variable Calculus? Actually, the 
problem is not so bad — we just need to have situations in which the density varies only in a 
single dimension. Okay. Fine. What does this mean? 


One type of problem that we can handle is essentially what we discussed at the beginning 
of Section [3.5] We suppose that we have a solid region, S$, in space, and that we know the area 
A(«) of every cross-sectional slice of S which is perpendicular to the z-axis, and that the solid 


region lies between the x cross sections where 7 = a and x = b, where a < b. 
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Figure 3.55: An « cross section with a constant density (x). 


Now, however, we also assume that the density of S at each point in a given x cross section 
is a constant 6(a); that is, we allow the density to vary as the x cross section varies, but we do 
not allow the density to vary inside the cross sections. 


Proposition 3.7.2. Suppose that a < b, and we have a solid region, S, in space, which lies 
between the x cross sections where x =a andx=b. Further, suppose we have a continuous 
function A(x), which gives « the cross-sectional area of S, and a continuous density function 
6(a) which gives the density of S at each point in the x cross section. 

Then, the infinitesimal mass dm of an infinitesimally thickened x cross section is given by 


dm = 6(a) dV d(x) A(x) da, 


«w=b b 
mass of S = dm = / 6(a)A(a) da. 


Before we begin with examples, we should discuss the units that are used for mass and weight 
in the Metric and English (a.k.a. FPS) Systems. 


The standard unit of mass in the Metric System is a kilogram, abbreviated as kg. The weight 
of an object which has a mass of 1 kg is the force that gravity exerts on the mass, which is the 
mass times the acceleration produced by gravity. Hence, the weight, at sea level on the Earth, 
of a 1 kg object is (approximately) (1 kg)(9.8 m/s?) = 9.8 kg-m/s?; this is 9.8 Newtons, where a 
Newton, abbreviated by N, is 1 kg-m/s? and is the standard unit of force in the Metric System. 
If we assume that we first have Newtons, meters, and seconds, we could have defined 1 kilogram 
as 1 N/(m/s?). The distinction between mass and weight in the Metric System is frequently 
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blurred in common speech. People often say something like “I weigh 70 kilos (kilograms)”. They 
mean, of course, that their mass is 70 kg; there simply is no good verb form of the word “mass”. 


You should be familiar with the standard unit of force in the English System; it’s the pound, 
abbreviated lb. Just as one 1 N/(m/s?) is called a kilogram, we give a name to 1 Ib/(ft/s?); it is 
called a slug, which is not abbreviated. The acceleration produced by gravity, using English units, 
is (approximately) 32 ft/s”, and so, 1 slug, at sea level on the Earth, weighs (approximately) 32 
lb. 


Example 3.7.3. Consider the sphere of radius R, centered at the origin, and let S' be the solid 
region contained inside the upper hemisphere, as in Example [3.5.6] Assume that all distances 
are measured in meters, and that the solid region S' is composed of some highly compressible 
material, so that the lower portion of S is more dense than the upper part. Assume that the 
density of S at each point in a given z cross section is 6(z) = 1000(2R — z) kg/m?. Find the 


mass m of S. 


Solution: 
As we saw in Example ]3.5.6| the infinitesimal volume of a thickened z cross section of S is 


dV = 7(R? — z?) dz cubic meters. The infinitesimal mass of this thickened slice of S is 


dm = 6(z)dV = 10007(2R — z)(R? — 2”) dz kg, 


and the total mass of S' is 


z=R R 
m= ; dm = p 10007(2R — z)(R? — z”\dz = 
z 0 


R 2 3 4 
1000 [ (QR? — R?z—2R27+2°)dz = 10007 (2R%2 R?. 5 2R- 5 + =) 
0 


13R? 13,0007R* ,. 
10007 (= — 0) a kilograms. 


There is another situation in which we can find the mass of a solid by integrating a density 


function which depends on only one variable. Suppose that we have a solid of revolution, S, as 


3.7. MASS AND DENSITY 337 


we discussed in Section [3.5] and that the density of the solid at any point P depends solely on the 
distance r from the point P to the axis of revolution. Then, corresponding to finding volumes 
of solids of revolutions by using cylindrical shells, we can integrate to add up the infinitesimal 
amounts of mass of each cylindrical shell to produce the total mass 


For instance, suppose that B is a region located in the first and/or fourth quadrant of the 
xy-plane, and we create a solid of revolution S by revolving B around the y-axis. Then, the 
distance from any point P to the axis of revolution is simply the x-coordinate of P, and the 


infinitesimal volume of a cylindrical shell is 


dV = 2raxh(x) dz, 


where the height function h(a) is determined by the shape of the region B. Now, if the density 
at each point P in S' depends only on the distance to the y-axis, then S has the same density 
at each point in the cylindrical shell of radius x; this means that 6 = 6(a), and the infinitesimal 
mass dm of the cylindrical shell of radius x is 


Now, of course, we integrate to add up all of these infinitesimal blobs of mass. 


Example 3.7.4. Recall Example [3.5.12] in which we looked at the curved triangular region B 
in the zy-plane, which has part of the y-axis as its left edge, its top edge is given by y = 2— 2, 
and its bottom curved edge is given by y = \/z. This region lies between x = 0 and x = 1. We 
revolved B around the y-axis, generating a solid of revolution S. 


Assume that all distances are measured in meters, and that, if P is a point in S and the 
distance from P to the y-axis is r, then the density of S$ at P is 200(1+ r) kg/m. Find the 
total mass of S. 


Solution: 


As we saw in Example]3.5.12} the infinitesimal volume of a cylindrical shell with radius r = x 
is given by 


dV = 2nrhdx = 2nx(2—a2—-2'/*) dx cubic meters. 


As the radius of the shell is r = x, the density of S at each point in this shell is 6(a) = 200(1+ 2) 
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kg/m, and the infinitesimal mass of this shell is 


dm = 6(x)dV = 200(1+ 2) -2na(2—a—a/?)\ dar kg. 


Therefore, the total mass of S, in kilograms, is given by 


w=1 1 
m= / dm = | 400 (1 + x) «(2-2 —2'/?) de. 
«z=0 0 


To calculate this integral, you “pull out” the 4007, multiply out (i.e., expand) the remaining 
terms, and use the Power Rule multiple times. We leave it as an exercise for you to verify that 


the result is 
= aed kilograms 
= oi baer 


There are two other “types” of densities that are commonly used, when the object in question 
is essentially 1-dimensional, like a wire, or essentially 2-dimensional, like a thin metal plate (a 


lamina); the corresponding densities are referred to as length-density, and area-density. 
What is length-density? 


Consider a thin straight wire. For many purposes (like ours), the wire is considered as a 
1-dimensional object, a line segment. Suppose that we lay out the wire along the x-axis, between 
0 and L, where FL is the length of the wire. Then, for each x between 0 and L, we want the 
length-density, 6(a), to be the limit of the quotient obtained by taking the mass of a small 
length of the wire around z, divided by that small length. Thus, by definition of 6¢(a), we want 


the infinitesimal amount of mass dm, at x, on the wire to be given by 


dm = 6¢(x) da, 


ice., d¢(x) is defined to be dm/dza. 


While it is standard to deal with length-densities as we did in the previous paragraph, it is, 
of course, true that a wire is actually a 3-dimensional solid object; it’s just that two of the wire’s 
dimensions are very small. If we think of the wire as having 3 dimensions, and think of usual 


density, then can we make sense of what length-density means? Yes. 
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At each x-coordinate between 0 and L, the wire has a tiny x cross-sectional area A(x). We 
suppose, as we did earlier, that, in each x cross section, the density has a constant value 6(2). 
An infinitesimal piece of volume dV on the wire at x is given by dV = A(x) dz and, as each 
point on the wire in the x cross section has density 6(x), the infinitesimal mass dm of this piece 


1S 


Thus, if you wish to think in terms of the usual density of a 3-dimensional solid object, you 
use the setup that we just discussed, and define length-density by ée(a) = 6(x)A(ax). Then, you 
once again obtain that 


dm = 6¢(x) da. 


Proposition 3.7.5. Suppose that a < b, and we have an idealized 1-dimensional object 
(think of a thin wire), laid out along the x-axis, lying between x = a and x = b. Further, 
suppose we have a continuous length-density function d(x) for the object. Then, the in- 
finitesimal mass dm of an infinitesimal portion of the object is given by dm = 6,(ax) da, 
and 


«w=b b 
mass of the object = / dm = / de(x) da. 


Example 3.7.6. A wire has been stretched out along the z-axis. Its left end is at x = 0 and its 
right end is at « = 4 feet. Suppose that the length-density of the wire is given by d¢(a) = e~* 
slugs /ft. 

What is the mass of the “left half” of the wire between xz = 0 and a = 2 feet? What is the 


‘ 


mass of the “right half” of the wire between x = 2 and x = 4 feet? What is the x-coordinate 


Lm of the point (the mass midpoint) such that half of the mass of the wire lies to left of x, and 
half of the mass lies to the right? 
Solution: 


The infinitesimal mass dm is simply 


dm = 6d¢(a)dx = e “dx slugs. 


Using the substitution u = —2, it is easy to show that fe~*dx = —e~* +C. 
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Thus, the mass Mz, of the left half of the wire is 


r=2 2 2 
/ dm = | ede = -e*| = —e ?—(-e") = 1-—e? & 0.8646647 slugs. 
xz=0 0 0 


The mass Mp of the right half of the wire is 


e=4 4 4 
/ dm = J: e "de = —e*| = —e-*—(-e7*) = e*7-—e* & 0.1170196 slugs. 
x 2 


=2 2 
The mass midpoint, x,,, is the point such that 
C= eu=4 
/ dm = / dm, 
x=0 L=Lm, 


that is 


Gov. 4 
i: edz = / e "dz, which yields etm —(—1) = —e-*— (—e-*), 
0 aH 


m 


Thus, we need 


2 


—) = 0.6749973 feet. 
e 


l+e-* = 2e-*™, = which gives us tm = in( 


Now we want to look at area-density. What is area-density? 


In a similar way to how we dealt with length-density, if we have a lamina (a plane region 
which has been thickened slightly, e.g., a thin sheet of metal), then we could think of the usual 
3-dimensional density, where one of the dimensions is very small. However, in analogy with 
what we did for thin wires, we typically just think of the area-density, 5a.(P), at each point P 
on our idealized 2-dimensional plate. Thus, an infinitesimal chunk of mass dm on the plate will 
be given by dm = 6,,(P) dA, where dA is an infinitesimal area. Of course, we still need to be 


able to reduce this to a one-variable problem. 
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Therefore, if our idealized 2-dimensional region B is in the ry-plane, we will assume that the 
density is constant in each cross-sectional line segment perpendicular the x-axis (or the y-axis), 
i.e., we assume that, for each fixed x, dar = dar(a#) and that we know the function h(a) which 
is the height (length, width) of the 2 cross-sectional line segment in B. Then, the infinitesimal 
area of a thickened 2 cross section, an infinitesimally thin rectangle, is dA = h(x) dx and its 
infinitesimal mass is 

dm = dar(#)dA = dar(x) h(x) dx. 


Proposition 3.7.7. Suppose that a < b, and we have an idealized 2-dimensional object S 
(think of a thin metal plate), in the xy-plane, lying between « = a and x = b. Further, 
suppose that we have a continuous height function h(x) which gives us the height of the 
x cross section of S, and a continuous area-density function bar(x), which gives the area- 
density of x at each point in the x cross section. 


Then, the infinitesimal mass dm of an infinitesimally thickened x cross section of the 
object is given by 
dm = dar(a)dA 


and 


«x=b b 
mass of the object =f dm = / Sar(a) h(x) da. 


Example 3.7.8. Suppose that a thin metal plate occupies the triangular region in the first 


quadrant of the ry-plane below the line y = 2 — x, where all distances are measured in meters. 


Figure 3.56: A triangular metal plate of varying density. 


Suppose that the area-density at each point in any given y-coordinate is constant, equal to 
dar(y) =e ¥ kg/m?. Find the mass of the plate. 
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Solution: 


The length h(y) of an infinitesimally thin rectangle in the plate, at a given y-coordinate, 
is simply the corresponding x-coordinate on the line given by y = 2— a, ie, h(y) = 2-—y 
meters. Hence, the infinitesimal area of the infinitesimally thin rectangle at each y-coordinate 
is dA = h(y) dy = (2 — y) dy. The infinitesimal mass of this rectangle is 


dm = dar(y)dA = e ¥(2—y) dy 
and, thus, the total mass of the plate is 


y=2 2 2 2 
mass of plate = / dm = | e 4(2—y)dy = | 2e %dy — i ye “dy kg. 
y=0 0 0 0 


This first integral is easy. We find 


7 2 
| 2e-4 dy = —2e-4¥| = —2e72 —(-2e°) = 2-2e7”. 
0 0 


We need to calculate the second integral and subtract. How do you calculate fe ye dy? By 
parts (Theorem |1.1.19). 


Let’s calculate the indefinite integral, and then we’ll plug in the limits of integration. Let 
u = y and that leaves us with dv = e~¥ dy. Then, du = dy, v = [ dv = [ e~¥ dy = —e7, and 
Integration by Parts tells us that 


[vcray - [ud = w— f vdu = y(-e") - fe dy = -ye Y¥-e Y+C. 


Therefore, we obtain 


2 2 2 
| ye “dy = -ye %—-e¥ = —e “(y+1)| = -e7(3)+1 = 1—3e, 
0 0 


and so, 


mass of plate = 2—2e7? —(1—3e77) = 1+e7? kg. 
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3.7.1 Exercises 


Throughout the exercises, assume all information is given in standard metric units. 


That is, mass is given in kilograms, time in seconds and lengths in meters. 


1. Suppose that S is a rectangular solid in the first octant with O <7 <3,0<y< 6, 
and 0 < z < 5, and that the density is constant along each cross section parallel to the 


xy-plane, with 6(z) = cosh z. Calculate the mass of S. 


2. Suppose that S is a rectangular solid with sides parallel to the coordinate axes, where 
each cross section parallel to the xy plane has uniform density 6(z), and where a < z < b. 
Prove that the mass is given by m = A(z) ih d(z)dz. That is, show that area may be 


factored out of the usual equation. 


3. Suppose that S is a solid right circular cone, centered along the y-axis, with vertex at the 
origin, and that the base lies in the plane y = 6 and has radius 4. Suppose that each cross 
section parallel to the xz-plane has density d(y) = 5y. Calculate the mass of S. 


4. Generalize the result in the previous problem. Suppose S is a solid right circular cone, 
centered along the y axis, with height h, base radius r, vertex at the origin and density 


function 6 = d(y). Give an integral for the mass of S. 


5. Suppose that S is the solid ball centered at the origin with radius 9, and that the cross 
sections parallel to the yz-plane have density 5(a) = 2? +1. Calculate the mass of S. 


6. Suppose that S is the solid ellipsoid defined by the equation 


a? 


9 


2 


2 
Y 
2 4F 4 
apg 
and that the cross sections parallel to the 2z-plane have density 6(y) = 2y* + y? +1. 
Calculate the mass of S. 


In each of Exercises 7 through 14, calculate the mass of the solid of revolution S. 
For each problem, assume the density function, 6, depends only on the distance r 
from the point to the axis of revolution (as in Example |3.7.4). 
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10. 


11. 


12. 


13. 


14. 


. Let S be the solid obtained by revolving the first quadrant region below the curve y = /x 


and above the interval [0,4] about the y-axis, if 6(r) =r+1. 


. S is the solid cone obtained by revolving the plane region bounded by the lines y = za, 


x =0, y =8 about the y-axis, with density function d(r) = 1+ r°. 


. More generally, assume S' is the solid cone obtained by revolving the plane region bounded 


by the lines y = mx, y = yo > 0 and x = 0 with m > 0, about the y-axis, given a density 
function d6(r). (Here, just give an appropriate integral for the mass.) 


Consider the plane region B in the first quadrant bounded by the lines y = 5 and a = 4. 
Let S be the solid obtained by revolving B about the y-axis, with density function d(r) = 
2+ cos(r?). 


More generally, assume that B is the plane region in the first quadrant bounded by the 
lines y= h > 0, 2 = b> 0 and S is obtained by revolving B about the y-axis, given a 


density function 6(r). (Here, just give an appropriate integral for the mass.) 


Using the notation in the previous problem, what is the mass if h = 5, b = 7, but d(r) is 
changed to e~"? 


Suppose B is the plane region in the first quadrant below the line y = 4, with 5 < a < 7. 
Let S be the solid obtained by revolving B about the y-axis, with d(r) = Inr. 


Consider the region in the first quadrant bounded by the curves y = 1.5a@ and y = cos72@, 
with x in [0, 1/3]. Let S be the region obtained by revolving this region about the y-axis, 
with 6(r) =r+1. 


In each of Exercises 15 through 19, you are given the length-density function, 6;(), 


of an infinitesimally thin wire lying on the z-axis over a given interval. For each 


problem, calculate (a) the total mass of the wire and (b) the mass midpoint (recall 


Example |3.7.6). You may need to use technology to numerically approximate the 


mass midpoint. 


. Oe(a) =sina, O<a< 7/2. 


. Oo(a) =alna, 3<a4<9. ec 


. Oe(2) =cosha, -l<a<l. 
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20. 


21. 


22. 


C 
Consider the following two quantities. C is the point in [a,b] such that / be(a) dx = 


b b 
1 

de(a) dz, and K = —/ de(a) dx. Briefly describe what C and K measure in 
=a de 


C 
physical terms. 


Suppose the length-density of a wire is d¢(x) = ce® for x in [0,10]. What is the mass 
midpoint? 


Suppose the length-density of a wire is dg(”) = In(cx) for x in [1,c], where c > 1. Calculate 


the mass midpoint in terms of c. 


In each of Exercises 23 through 28, you are given a region B in the xy-plane, occupied 


by a thin metal plate, and an area density function for the plate. If 6,, is given as 


a function of x (resp. y), then 6,, gives the area-density of the plate at each point 


in the x (resp. y) cross section. Calculate the mass of the metal plate. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


Let B be the intersection of the disc centered at the origin with radius 3 and the first 
quadrant, dar(a) = 3a +1. 


B is the intersection of the disc centered at the origin with radius 5 and the first quadrant, 


daly) = f~l—y?2+1. 


Let B be the triangular region bounded by the coordinate axes and the line y = 2a — 6, 
bar(x) = 27 +1. 


B is the region in the first quadrant bounded by the curves y = cosx and y = sing, with 


O0<a <4, and 6,,(4) =a2+1. 
B is the region in the first quadrant bounded by y = Vz and x = 4, da(y) = y°/? +2. 


B is a triangle in the first quadrant with vertices (0,0), (0,a), (6,0). dar(y) = 1/(b+ 1). 


Suppose B is triangle in the first quadrant with vertices (0,0), (0,2), (5,0) with density 
function 6a+(2) = cx?. What is c if the total mass is 25 kg? 


The average density of a solid region with varying density is the total mass divided 


by the total volume. Use this idea to answer each of Exercises 30 through 35. 


30. 


Suppose S is a solid region where the density function of any plane parallel to the yz plane 


is given by 6(a). Suppose further that the area of each cross section parallel to the yz 
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plane is A(x) and that a < x < b. Show that the average density of S is given by 


b 
d(x)A(a) d 
average density = Jo 0G )AG) Or 
J, A(z) da 
31. Calculate the average density of the solid in Exercise 3. eo 


32. Calculate the average density of the solid in Exercise 5. 


33. Suppose that 0 < a < b, that h(x) is continuous and positive, and that B is the plane 
region under the graph of h(x) and above the interval [a,b] on the z-axis. Let S be the 
solid obtained by revolving B about the y-axis. Suppose further that the density of the 
object, 6(7), depends only on the distance from the y-axis. Show that the average density 
of the solid is 

J. 2ah(w)5(a) dex 

tis Qrch(x) de 


average density = 


34. Calculate the average density of the solid in Exercise 7. 


35. Calculate the average density of the solid in Exercise 8. 


It’s common in physical applications for the density and mass of a region to evolve 
over time. For example, we could be considering a rectangular solid initially con- 
taining water, but where oil is added and mixed with the water. Over time, the 
density and mass of the solid will evolve. In each of Exercises 36 through 39, the 


density function is a function of time as well as position. 


36. S is a rectangular solid with sides parallel to the coordinate planes, with —2 < x < 4, 
3<y<6,and7<z<9. The density function 6(a,t) of a cross section parallel to the yz 
plane at time t is 6(z,t) = (t — 3)?e”. 

a. Calculate m(t), the mass of the solid at time t. 
b. Calculate dm/dt. 
c. Assume t > 0. Is there a finite time when the mass is minimal? 
37. Suppose S is a spherical ball centered at the origin with radius 3 and that the density 


i 
function is d(r,t) = —s where r is the distance between a point in the interior of the 


ball and the origin. Assume t > 0. 


a. Calculate m(t), the mass of the ball at time t. 
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b. Calculate dm/dt. 


38. In the previous problem, what is the average mass of the ball between times t = 2 and 
t= 5? 


39. Suppose B is the region in the first quadrant bounded by the graph of e” and y = 9. Let 
S be the solid region obtained by revolving B about the y-axis. Suppose the density of an 
x cross section of S is given by 6(x,t) = ta +t? + 1. What is m(t)? 


In the chapter, we saw that there was a special formula for calculating the mass of a 
3-dimensional object when the density at a point depends on the distance between 
the point and an axis. A similar formula exists in 2 dimensions. Suppose B is a 
annulus or a disc in the ry-plane bounded by r = rp and r = r; and that the area- 
density function, 6(r), depends only on the distance between a point on the surface 


and the origin. Then the total mass is 


mass = an f ro(r) dr. 


TO 


Use this formula to calculate the mass of the surface in each of Exercises 40 through 
43. 


40. B is a disk with radius 3 centered at the origin and with density function d(r) = 38r + 1. 


r+2 


41. B is a disk with radius 5 centered at the origin with density function d(r) = I 
r+ 


ic} 


42. B is an annulus centered at the origin bounded by circles of radii 4 and 7. d(r) = 1/r. 


43. B is an annulus centered at the origin bounded by circles of radii 2 and 6. d(r) = 4e” . 
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> 
3.8 Centers of Mass 


and Moments 


We frequently discuss objects as though they are located at specific points. Of course, objects in 
real life exist at an infinite number of points in space. When we state that an object of mass m 
is located at a point P, we are frequently thinking of an idealized “point-mass”: an imaginary 


object which occupies a single point in space at any given time. 


However, the idealized notion of a point-mass is clearly unsatisfactory for many applications. 
For instance, the magnitude of the gravitational force between objects with masses M and m, 
is F = GMm/r?, where G is the universal gravitational constant and r is usually described as 
being the “distance between the two objects”. What is meant by this “distance”, when each 


object occupies an infinite number of points in space? 


The answer is that the “distance” means the distance 
between the centers of mass of the two objects. Given an 
object, or collection of objects, of (total) mass M, the center 


of mass is a point P in space such that, in many physical 


problems, the object(s) can be treated as a point-mass, with 
mass M, located at the point P. If you were trying to bal- 
ance a rigid wire or metal plate on your finger, the center of mass is where you would place your 
finger. If two children want to balance on a see-saw, the center of mass is the point where the 


base (fulcrum) of the see-saw needs to be. 


Note that the center of mass of an object need not actually be located at a point on the 
object. Perhaps the easiest example of this is a uniformly dense annulus: think of a thin metal 
disk with a smaller disk removed from its center; by symmetry, the center of mass is located at 
the center of the annulus, but that part has been removed. What this means is that there is no 
place where you could place your finger in order to balance an annulus. Hopefully, this seems 


intuitively clear. 


For a solid object, the center of mass can be determined, in principle, via integration, from 
the shape and density function (Section|3.7) of the object. For an object with constant density, 


the center of mass is also known as the centroid of the object. 


In order to discuss how integration can be used to find centers of mass of objects, we first 
need to understand what happens for a finite number of point-masses. We shall describe this, 


and then define the center of mass. 


The discussion of the center of mass is most natural using the language of vectors, which we 
looked at in Section [3.3] and for which there’s a quick summary in Appendix[A] We shall use 
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vectors now, but we shall also describe the situation later for wires and thin plates, and there we 
shall not refer to vectors. Our treatment follows that in the excellent volumes of The Feyman 
Lectures on Physics, [I]. 


Suppose that we have n point-masses, whose masses are constants ™1,..., 7%, (not all zero), 
and the masses are moving in space (or in a plane, or in a line). Let 7; = 7;(t) denote the 
position of the mass m;, for i = 1,...,n, at time ¢. (In other places, we have used p to denote 


position; it is standard in our current situation to use 7, for radial vector.) 


Suppose now that we have a force F, = F(t) acting on each mass m,, and that F, is the only 
force acting on m,;. Then, the vector form of Newton’s 2nd Law of Motion tells us that the net 
force acting on an object (of constant mass) is the mass times the acceleration of the object, 
i.e., 

dF; 
= Mian 

Now suppose that we think of all of the masses together as one object, possibly because 
they’re very close together, but possibly not. Then, the total mass of the new “object” is 
M = Sx", mj, and the net force acting on the object is F = S~"_, F;. We want to be able to 
apply Newton’s 2nd Law to the collective mass M; that is, we want that F is equal to the mass 
M times the acceleration. But what is this “acceleration” the acceleration of? Acceleration is, 
of course, the second derivative of position, with respect to time, but what position do we take 


the second derivative of? 


Using that the masses m; are constant, so that M is also constant, we have 


cm ” d?(mifi) P [ (Sie mari) / M] 
Fe= SOK = by dt2 one iP 
i=1 


i=1 


Therefore, if we define a position vector 


dom ai 
S a i=1 i=1 
cm _— cm t b] 
fom = Fem(t) = =, : 
ym, 
i=1 
then we obtain that 
— rom 
F=M (3.2) 
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Hence, we make the following definition: 


Definition 3.8.1. The center of mass of a collection/system of objects with masses 
M1,-.-;Mn at positions 71,...,7n, respectively, is the point 


If we let 7; = (ai, yi, zi), then we find that 


nm nm 
x-coordinate of the center of mass = (>: ms i (>: ms) ; 
i=1 i=l 


i=l 


n n 
y-coordinate of the center of mass = (>: ma) | (>: m) : 
i=l 


n n 
z-coordinate of the center of mass = (>: ms] / (>: ms) ‘ 
i=l j=1 


The quantity M = >*"_, m; is the total mass of the system. The quantities )~)_, mjixj, 
we miyi, and S*_, mz; are referred to by various terms involving the word moment; 
we shall refer to them as the x-, y-, and z-components, respectively, of the mass- 
moment about the origin. In fact, as we shall consider only mass-moment about only 
the origin, we will usually write simply the x-, y-, and z-components of the moment. 


Different sources also use different notations for the components of the moment. We 
shall try to avoid confusion; if M is the total mass, then we write Mz, My, and MZ for the 
z-, y-, and z-components of the moment, respectively. 
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Remark 3.8.2. Note that if we replaced fom in Formula [3.2] with F = Tom + tUo +70, where 
Uo and 7p are constant vectors, then the equation would still hold since the 2nd derivative of 
tvp + 7, with respect to t, is zero. However, when there is only one non-zero mass, i.e., when 
mi, # 0 for exactly one index ip, we, of course, want the center of mass to be where that non- 
zero mass is, namely, at 7;,. Since we want this to be true at all times, and regardless of which 
mass is non-zero, we are required to choose vj = 0 and 7 = 0, ie., to define the center of mass 


as we did. 


Suppose now that we have a thin wire, of (possibly) variable length-density d¢(a), laid out 
along the x-axis; see Section B.7] Then, its center of mass has only an x-coordinate. How do we 
find it? 


It’s easy, now that we know what to do for a finite number of point-masses. We think of 
chopping the wire up into a large (but finite) number of small pieces, taking approximating 
Riemann sums, and taking a limit. Of course, we describe the result infinitesimally instead. 
Around the point at a given x value, an infinitesimal chunk of mass dm contributes dm to the 


total mass and «dm = x6¢(x) dx to the xz-component of the moment. What we conclude is: 


Proposition 3.8.3. Suppose that we have an idealized 1-dimensional object, laid out along 
the x-axis, between x = a and x = b, with a continuous length-density function d¢(«). 


Then, the x-component of the moment is 


«w=b b 
; zdm = / xde(x) da, 


the total mass is 


«x=b b 
M =| dm = ‘: d¢(x) da, 


and the center of mass is located at the x-coordinate given by 


7 fr xdm - xbe(x) dx 
gL = = 


M L? bela) dx 
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Remark 3.8.4. Suppose you have a thin wire with a constant (positive) length-density. Where 
do you intuitively believe that the center of mass is located? Hopefully, it seems clear that it 
should be the midpoint of the wire. Let’s make certain that that’s what we get from the formula 
in Proposition [3.8.3] 


Suppose that d¢(a) = d¢ > 0 is a constant, and that the wire lies between « = a and x = b, 
where a < b. Then, 


foatdm — ferdede _ de focde Fl, _ F-F _ b+a 
M f? eda Oe ff? de x|? b-—a 2” 


which is the midpoint of the wire, as we expected. 


Recall that the center of mass of an object with constant density, as above, is called the 
centroid. What we have just seen, in the case of a wire, is that the constant density cancels 
out in the calculation; so that the location of the centroid is independent of what the constant 
density actually is. Hence, the centroid depends only on the shape of the object. As we shall 


see, this is also true for thin metal plates, and solid objects. 


It is important that, in general, you cannot take the fraction 


ie x6o(x) dx 
f? be(a) dex 


and somehow cancel the 62(2) in the integral in the numerator with the 6,(a) in the integral 
in the denominator. This is true for constant density because, then, we may factor the 
density out of the integrals. 


Example 3.8.5. Let’s look again at the wire from Example[3.7.6] The wire was stretched out 
along the x-axis. Its left end was at x = 0 and its right end was at x = 4 feet. The length-density 
of the wire was given by d¢(”) = e~® slugs/ft. 


We found that the x-coordinate z,,, of the point such that half of the mass of the wire lies to 
the left of x,, and half of the mass lies to the right was 2, = In (724) = 0.6749973 feet. 
Is this the same as the center of mass? No. The calculation of x,,, did not take into account the 
actual x-coordinates of the infinitesimal chunks of mass. Let’s calculate the center of mass and 
see what we actually get. 
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Back in Example |3.7.6} we already calculated the masses of the left and right halves of the 
wire; they were/are M, = 1—e7~? slugs and Mp = e~? — e~* slugs. Thus, the total mass of the 
wire is 


M = M, + Mr = 1-e7% slugs. 


The z-component of the moment is 


w=4 4 4 
Mz = / zdm = : xop(x) dx = | xe dx. 
2=0 0 0 


This integral is evaluated using integration by parts. We actually already calculated this indef- 
inite integral (with y’s in place of x’s) back in Example where we found that 


[yetay = [ude = w— fvdu = ye") — fe dy = -ye %—-e 4+C. 


Therefore, 


Dividing this by the total mass, we find that 


= 1—5e-4 


feat = 0.925370558545 feet. 
—e 


Notice that this is almost 50% larger than x, the “mid-mass” point. 


Now, consider a thin metal plate, lying in the zy-plane. As in Section [3.7] we will assume 
that we have an area-density function for the plate which depends on a single variable, say 2. 
That is, we assume that our plate lies between the vertical lines x = a and x = 6, and that we 
have a continuous area-density function da;(x), for a < « < b, which gives the area density of 
our object at each point in the x cross section. We assume that we have a continuous function 


h(a) which gives the height/length of the x cross section of the object. 
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Then, we know that an infinitesimally wide vertical slab of mass is given by dm = dar (x) dA = 
dar(x)h(a) da. In addition, the contribution of this slab of mass to the x-component of the 


moment is easy; it’s adm = xbar(x)h(x) da. 


The slightly difficult part is the contribution of this vertical slab of mass to the y-component 
of the moment. We would like to say that it’s ydm, but there are an infinite number of y 
coordinates in each vertical slab. So what y-coordinate do we use? Actually, that’s easy; we use 
the y-coordinate 7¥,, of the center of mass of the infinitesimal slab given by thickening the x cross 
section. Since the density is constant in a vertical slab, 7, is simply the y-coordinate of the 
midpoint of the x cross section, which is one half the sum of the lower and upper y-coordinates 


of the cross section. 


Therefore, we obtain: 


Proposition 3.8.6. Suppose that a < b, and we have an idealized 2-dimensional object S 
(think of a thin metal plate), in the xy-plane, lying between the lines x = a and x = b, under 
the graph of y = f(x) and above the graph of y = g(x), where f(a) and g(x) are continuous 
functions and f(x) > g(x) on the interval [a,b]. Further, suppose that we have a continuous 
area-density function dar(x), which gives the area-density of x at each point in the x cross 
section. 


Then, the total mass of the object is 


M = [- a [oto da = [ su(o) 40) - 90) i 


The x-component of the moment is 
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The y-component of the moment is 


My 


Example 3.8.7. Suppose that we have a thin metal plate, occupying the region in the ry-plane 
which is bounded on the top and left by the graph of y = x”, on the right by the line given by 
x = 2, and on the bottom by the line given by y = 1. Suppose that the area density function is 
bar(z) = @ kg/m?. 


Figure 3.57: A thin metal plate of variable density. 


Find the center of mass of the plate. 


Solution: 
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First, let’s find an expression for the infinitesimal mass dm of an infinitesimally thin rectangle 


at a given x value. We have 


zdm = x-(#2—2)dx = (x*— 2x7) dz, 


and the contribution to the y-component of the moment is 


2 
Y,dm = = #1 o(a? — 1) de = 5 rat — 1) dx = 50? — 2) de. 


Thus, we find that the mass is 


OMI a eee ine 


9/4 135 


and 
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Figure 3.58: Center of mass indicated in green. 


In Figure |3.58} we have indicated the center of mass in green. 


Example 3.8.8. Consider the triangular region in the first quadrant, under the line given by 
y = ka, where k > 0, and above the interval [0, 6], where b > 0. Find the centroid of this region. 


< 


y=kx 


x=b 


Figure 3.59: A triangular metal plate of constant area-density. 


Solution: 
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As we are finding a centroid, we assume that the area-density 6,3, > 0 is a constant. The 


infinitesimal mass dm of the infinitesimally thin rectangle at x is given by 


dm = bdarydA = dark dz. 


The infinitesimal contribution from this rectangle to the z-component of the moment is 


cdm = wtarkadx = aku? da, 


and the infinitesimal contribution from this rectangle to the y-component of the moment is 


k? 
J,dm = ee backer de = Pak x? du. 


The mass of the plate is 


x=b x=b 
mass = / Oar dA = Oar dA = day: (area of the plate). 
xz=0 x=0 


Of course, when the area-density is constant, it’s always true that the mass of the plate is simply 
dar times A, the area of the plate. For a more-complicated region, we would have to integrate 
to find the area and, even here, we could integrate kx to find the area of the triangle, but it’s 
a triangle; it’s area is 1/2 the base times the height, ie., A = (1/2)b(kb) = kb?/2. Hence, we 
have 

Sarkb? 


M= 
2 


The z-component of the moment is 


x=b b 3 

Sarkb 
Mz = / sdm = b Che de =e 
x=0 (0) 3 


and the y-component of the moment is 


a=b b 2 k23 
My = / y,dm = i Oaah 2 am = Saki 
x2=0 0 2 6 
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Thus, we find 
_ darkb?/3 2b 


darkb2/2 3 


and 
_ bark7b? /6 __ kb 


barkb2/2 3 


If we let P denote the vertex at the right angle, i.e., the point (b,0), then the centroid is 
located at the point with z-coordinate which is 1/3 of the way along the base from P, and with 
y-coordinate which is 1/3 of the height of the triangle away from P. 


y=kx 


Figure 3.60: The centroid is indicated in green. 


Now we would like to look at one example involving the center of mass (actually, here, the 
centroid) of a solid region. 


Example 3.8.9. Recall the general definition of a cone from Definition Consider, as we 
did just before Proposition |3.5.5| the cone whose base is a region contained in the plane at the 
fixed z-coordinate z = H > 0, with vertex at the origin. 


By looking at the case where the plane region was a rectangle, as in Example|3.5.3| we can/did 
conclude that the cross-sectional area of the cone at a given z-coordinate is proportional to 27, 
ie., there is a constant k such that A(z) = kz?. 


Without knowing more about the base of the cone, it is not possible to find the z- and 


y-coordinates of the centroid. However, we can easily find the z-coordinate. 
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We assume that the density is a constant 6 > 0. An infinitesimal slab of mass dm is obtained 
by taking the area of the z cross section and multiplying it by an infinitesimal thickness dz. 
Hence, 


dm = 6dV = 6A(z)dz = 6kz* dz. 


The contribution to the z-component of the moment of this slab is 


zdm = 2-6kz*dz = bkz dz. 


Therefore, we find 


i cdm — fo bke8dz — SkH4/4 3 


ft dm fe ok2dz — OkH8/3 4 


Note that this is independent of the area or shape of the base of the cone and, of course, 
independent of 6. 


Figure 3.61: A cone whose base is a general plane region. 


What would we need to know in order to find the x- and y-coordinates of the centroid? We 
would need for each z cross section to be very symmetric, let’s say they’re circular disks, and 
would need to know the z- and y-coordinates, a(z) and b(z), respectively, of the center of each 
z cross-sectional disk, for these coordinates are clearly the coordinates of the centroid of each 


disk itself. In analogy with what we wrote for thin metal plates, we write %, and y, for a(z) 
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and b(z), respectively. 


We would then have 


—— ea T,dm _ tes T6kz* dz _ fo ree de _ a Bz 27 dz 
SZ" dm for bz? dz fo 2a HB /3 
and 
7 Sew Badm fol g.dkodz fy’ B.2de _ fo 0.7 de 
J" dm Jor dkz? dz fol 2dz H3/3 


The most simple, standard case would be that of a right circular cone, centered along the 
z-axis. Then, 7, = 0 and y, = 0, and the above formulas yield that © and ¥ are both zero. 
Good — that’s what we would automatically conclude by appealing to the symmetry of the right 


circular cone. 


3.8.1 Exercises 


In Exercises 1 - 5 you are given the position vectors of a collection of objects in 
two or three dimensions along with the mass of each object. Calculate the center of 
mass of each system. For each of the two dimensional systems, make a plot showing 


the position of the objects and the center of mass. 


Ti M4 
1 (0,0) | 5 
") (0,1) | 10 
(1,0) | 4 
ri Mi 
(-1,0 3 
2. | (0,1) 6 
(1,0) 4 
(0,-1 9 
ri mi 
(1,1) 1 
3. | (2,2) | 2 
(3,3) | 3 
(4,4) | 4 
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0) 

1) | 15 
,0) | 12 
0) 


3 
= 


on 

—> 

=) 

a 

So 

mS 
oInw fb wy 


(0,0,-1) | 7 


6. Consider a collection of five objects. Four of the objects lie at the corners of a square and 
have mass m. The fifth object lies at the geometric center of the square and has mass k. 
What is the center of mass of the system? 


In Exercises 7 - 12, you are given the length-density function 6;(x), of an idealized 
1-dimensional object that lies along the z-axis. For each problem, calculate (a) the 


x-component of the moment and (b), the z-coordinate of the center of mass. 


7. de(x) = 3x? + 2x +4, x in [2,7]. 


8. dg(%) = cosa, x in >: “I. g 


2x+1 
9. be(x) — 37 2 


10. g(x) = alna, «x in [3,6]. 


, @ in [4,12]. 


11. d¢(a) = 5%, x in [2,3]. 
1 


x 


13. Explain why an area-density function of 6,(x) = x? — 3 is valid on the x-interval [2,5] but 
not on the interval [1, 4]. 


14. Consider an annulus centered at the origin with outer radius 2 and inner radius 1. Suppose 
the top half of the annulus is made of a material with uniform density 6 and that the lower 
half of the annulus is made of a second material with uniform density ké. What is the 


center of mass? 


In Exercises 15 - 20, you are given an idealized 2-dimensional object S in the 
xy-plane and the area-density function 6,(x) of the object which gives the area- 
density of x at each point in the x cross section. Find (a) M, (b) Ma, (c) Mg, 
(d) Z, and (e) #. 
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15. S is bounded by the curves f(x) = x and g(x) = x? between x = 0 and x = 1. da(x) = 
22 +1. 

16. S is a rectangle with vertices (1,1), (1,5), (6,5), and (6,1), da(a) = e”. eo 

17. S is the region bounded by the curves f(#) = V4— 2? and g(x) = V1—2?, x = —1/2, 
x =1/2 and 6,(x) = x? +1. 

18. S$ is a triangular region with vertices (0,0), (1,1) and (—1,—1). 6(a)(x) = 2— 2?. 

19. S is bounded by the lines f(x) = 32 +5, g(x) =x —3, x = —4 and x = 3, ba(x) = 2? +5. 

20. S is the region below the curve f(z) = cosa and above the line g(x) = 1/2 and = is 
between —7/3 and 7/3. da(x) = |a| +1. 

21. Find the centroid of the region occupied by the metal plate in Example 

22. What is the centroid of a right circular cone where the base lies in the plane z = 9 and 
the vertex is the origin? 

23. Consider a cone of uniform density which has a circular disc as its base. The base lies in 
the plane z = 1, has center (1,1, 1) and radius 1. The cone is the union of all line segment 
between the origin and points on the disc. What is the centroid of the cone? 

24. Must there be any mass at the center of mass of a system? If not, give an example. 

25. Suppose f(x) is a continuous function on the interval [a,b] and that f(#) > 0. Let S be 


the region lying below the graph of f(a), above the z-axis, and between x = a and x = b 
and suppose S$ has constant density 6. Show that the center of mass, or centroid, has 


a i i 7 
coordinates Z = af xf (x) dx and ¥ = af f(a)? dx where A is the area of S. 


Use the previous problem to find the centroid of the regions in Exercises 26-31. 


Ass 


ume the density is constant and that each region lies below the graph of f(x), 


above the z-axis and between x = a and x = b. 


26 


27 


28 


29 


30 


31 


v)=27+4, 0% =3,2=7. 


v)=sing,«#=0,0=T7. 


x) = mx where m > 0,2 =0, x = b. 


if 
=i 
. f(z) =coshz, x =—1, « =1. 
_f 
. f(z) = J/16 — 22, = -4, 2 =4, 
of 


v)=e*,x=0,r=1. 
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32. 


Suppose f and g are two continuous functions on the interval [a,b] and that f > g. Let 
S be the region lying between the graphs of the two functions and bounded by x = a 


and « = b. Assume S has constant “a 4 and prove the centroid of S has coordinates 


=F fase) - ote)yae and p= ef (100)? - (2?) a 


Use the previous problem to find the centroid of the regions in Exercises 33 - 38 


bounded by the graphs of f(x) and g(x), and by « = a and x = b. Assume in each 


case the density is constant. 


33. 


38. 


f(a) = 2049, g(a) =244+5,27=4, 2 =8. 

. fe) =9, oa) =a", eo = +3, = 8. 

f(z) = 64 — 2?, g(x) = V9 — 2?, 2 = -3, 2 =3. eo 
f(z) = V2, g(x) =27,c7=0,0= 

f(a) = 2e*, g(a) =2,r=0,2=3 

f(a) = 52° + 3, g(x) = 22°, x = -2,4 =2 


In Exercises 39 - 44, you are given a dynamic system where a group of particles of 


varying masses are moving. The path of the i-th object, which has mass m,;, has 


position p;(t) at time t. Calculate the location of the center of mass as a function 


of t. 


39. 


AO. 


Al. 


42. 


43. 


4A, 


A5. 


Pi(t) = (0,0), po(t) = (t,t), mi = 8, m2 = 4. 
pilt) = (t,0), po(t) = (—t,0), mz = 5, me = 10. oe 


Di (t) = (t,0), P2(t) = (40); p3(t) = (0, t), my =m. = 6, M3 = 5. 


pi(t) = (cos 2t, sin 2t), po(t) = (2 cost, 2sint), m1 = 100, mz = 300. This example models 
two objects orbiting a central mass-less point. The first ” planet” completes a revolution 


in half the time as the outer planet. 


pi (t) = (cos 2t, sin 2t), po(t) = (2cost,2sint), p3 = (0,0), my = 100, mz = 300, m3 = 
1000. This system is similar to the previous one, but now includes a massive ”sun”. 


pi(t) = (—2 + cost, sint), po(t) = (2+ cost, sint), m, = 30, mz = 20. 


Suppose three particles are initially at rest, and have initial positions p;(0) = (—2,2), 
p2(0) = (1,-3), p3(0) = (4,1). At ¢ = 0, a constant external force is applied to each 


particle. The first particle experiences a force of 6 N in the negative x direction, the 


3.8. 
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second a force of 14 N in the positive x direction and the third, a force of 16 N in the 


positive y direction. The masses of the three particles are 4 kg, 4 kg and 8 kg respectively. 


a. Calculate the position function, p;(t), for each of the three particles. 


b. Calculate the position function of the center of mass at time t. Call this function 
E(t). 

c. Calculate the magnitude of the acceleration of the center of mass using your answer 
from part (b). 

d. What is the total external force acting on the system? Hint: resolve the force into x 
and y components first. 

e. Calculate the magnitude of the acceleration of the center of mass by dividing the total 
external force acting on the system by the total mass of the system. Your answer 


should agree with the answer to part (c). 


In Exercises 46 - 50 you are given a system similar to that in the previous prob- 


lem. Assume each particle starts at rest and that a constant force is applied to 


each particle starting at t = 0. For each problem, calculate the magnitude of the 


acceleration of the center of mass in two ways: first, by explicitly calculating the 


position function of the center of mass, and second, by calculating the total force 


acting on the system. The force acting on particle i is given in z and y components. 


For example, F = (3,0) indicates a 3 N force in the positive x direction. 


46. 


AT. 


48. 


49. 


50. 


pi (0) = (150), p2(0) = (0, 1), P= (3,0), fh= (0, 3), m, = 5, mz = 5. 


pi(0) = (0, 0), p2(0) = (2,3), p3(0) = (—1,=3), P= (2, 0), fo = (0, —4), f3 = (—3, 0), 


my 3, mg 6, m3, 12. 


Di (0) = (1,0), p2(0) = (0, 1}, p3(0) _ (0, 0), Fl= (2,0), fy = (0, 2), f3 = (—2, 0), m, =4, 


ms. = 8, ms = 12. 


Di (0) — (1,0), p2(0) — (0, 1), p3(0) = (1,0), pa(0) = (0,1), Pi = (3,0), Fh, = (0, 3), 
F3 = (—1,0), F,(0) = (0, —1), My mg 1 m3, m4 5. 
Di (0) = (1, 0,0), p2(0) _ (0, 1,0), p3(0) _ (0,0, 1), pa(0) = (0, 0, 0), PL = (2, 0,0), fy = 


(0, 3,0), Fs = (0,0,4), F4(0) = (0, —3,0), my = mg = 6, m3 = m4 = 12. 
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> 
3.9 Work and Energy oe" 


The technical notion of work does not coincide precisely with what is frequently referred to as 


work in day-to-day speech. Work is defined to be force applied over a displacement. We shall 
clarify this below, but, for now, we want to emphasize that, if an object does not move, then 
no work is done on the object. This conflicts with what most people would refer to as the work 
required to hold some heavy weight motionless in their hands. We have a tendency to think of 
work as corresponding to applying force over a period of time, not over a displacement; when 


you're in a physics or mathematics class, you need to stop thinking this way. 


Work is, in a technical sense, equivalent to energy, though we usually use the terms slightly 
differently when speaking and writing. We typically say things like “it takes energy to produce 
work”, but it’s also true that the work done on a object can get “converted” into kinetic energy, 
potential energy, heat, etc. All of this falls under the heading of the principle of Conservation of 
Energy, which states that energy cannot be created or destroyed, merely converted from one form 
to another. Since the work of Einstein showed that mass can be converted into energy, according 
to the famous equation E = mc?, which was first put into practice with the development of the 
atomic bomb, the principles of Conservation of Energy and Conservation of Mass are frequently 
now combined, and referred to as Conservation of Mass-Energy. 


Assume that we have an object whose motion is constrained to a straight line, which we take 
to be the z-axis. We suppose that we have a constant force F' which acts parallel to the x-axis, 
with the sign of F' indicating the direction of the force. If the object begins at x-coordinate xo 


and ends up at x-coordinate x;, then the displacement of the object is Ax = x1 — 2p. 


Definition 3.9.1. The work done by the constant force F on the object, in displacing the 
object Ax, is FAa. 


In the metric system, the standard unit of work/energy is 1 joule, which is equal to 1 
Newton-meter. In the English system, you simply use the foot-pound. 


Remark 3.9.2. Note that work can be positive or negative. If the force is in the direction of the 
displacement, i.e., if the force and displacement are both positive, or the force and displacement 


are both negative, then the work is positive. However, if the force and displacement are in 
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opposite directions, then the work is negative. For example, suppose that we exert a force 
upward, raising an object against the force of gravity. Then, we do (or, our force does) positive 


work on the object. However, gravity does negative work on the object during that displacement. 


Instead of saying that a particular force does work, it is common to say that whatever is 
producing the force does the work. For instance, if Sally pushes an object along the ground, 
then we would usually say that Sally does work on the object, rather than refer to the force that 
Sally exerts. 


So, what do we do when the force is not constant? We do what we always do: chop things 
up into small pieces, over which we can assume that the force is approximately constant. Then, 
we take Riemann sums, and limits, to arrive at the definite integral. Infinitesimally, we write 
simply that the infinitesimal work dW done by F' in displacing an object by the infinitesimal 
amount dz is dW = F dz, and so: 


Proposition 3.9.3. If a continuous force F = F(a) (parallel to the displacement) acts on 
an object, as the object is displaced from xo to x1, then the work W done by F on the object 
is given by 


Fda. 


Remark 3.9.4. In some of our later problems, in which we look at lifting liquid out of a tank, 
it will be more natural to consider the amount of displacement q as a function of the force, 
i.e., we will use that, essentially, g is a function of F'. This means that, instead of using that 
dW = F dz, we will use that dW = qdF. 


Example 3.9.5. Suppose, when an object is located at position x meters, that the object 
experiences a force of F = —k/x? Newtons, where k is a positive constant. Find, in terms of k, 


how much work the force does on the object, as the object moves from + = 2 to x = 1 meter. 


Solution: We simply calculate 


368 CHAPTER 3. APPLICATIONS OF INTEGRATION 


1 kt, 
(1-5) =— joules. 


Before we look at more examples, it will be useful to first discuss the work/energy equivalence 
and, in particular, discuss the relationship between work, kinetic energy, and (gravitational) 


potential energy. 


Definition 3.9.6. The kinetic energy of an object of mass m, moving with speed v, is 


Note that it is classical to use v in the formula for kinetic energy, even though the v is speed, 
not velocity. For motion in a straight line, where the direction of the velocity is given by a plus 
or minus sign, the distinction between speed and velocity in the calculation of the kinetic energy 
is irrelevant, for v is squared. However, for the more general situation of arbitrary motion is 


space, it is important that you use speed in calculating the kinetic energy. 


The relationship between work and kinetic energy is given by: 


Theorem 3.9.7. Suppose that F is the sum of all forces acting (along the x-axis) on an 
object of constant mass m, at all times between to and t,, where to < t,. Suppose that 
the position x = x(t) of the object is continuously differentiable, that the velocity v is a 
continuously differentiable function of x. 


Let x9 and x1 denote the positions of the object, at times to and t1, respectively, and let 
vg and v1 denote the velocities of the object, at positions x9 and x1, respectively. 


Then, F is a continuous function of x, and the work done by F in displacing the object from 
Xo to x1 1s given by 


oi 12 1 2 
Fdz = =mvy — =mMvp, 


2 2 


Proof. Let v = dx/dt denote the velocity. By Newton’s 2nd Law of Motion, 


dv dv dx dv 


da dt da” 
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where the third equality follows from the Chain Rule. 
Thus, 


Z1 V1 V1 1 1 
W = i Fdz = / nee ae = ; mudv = =mvi — =m. 
_ ae dx a 2 2 


(0) 


Example 3.9.8. Suppose that x(t) = 4—?? is the position function, in meters, of a 10 kilogram 
object, which is moving along the x-axis between times t = 0 and t = 3 seconds. Calculate the 
work done in displacing the object from x(0) = 4 meters to 7(3) = —5 meters. 


Solution: 
By Theorem|3.9.7| all that matters is the change in kinetic energy, so we need the velocities 
at times t = 0 and t = 3. We find 


vt) = 2) = =—2t mais. 


Thus the work done is 


5 (10)(v?(3) — 0(0)) = 5((—6)*—07) = 180 joules. 


Remark 3.9.9. A special application of Theorem comes up fairly often. 


Suppose that some force F.x4, which we’re thinking of as external, acts on an object with 
constant mass, and that the only other force is Fyou, exerted by you (or anything/anyone else). 
Then, the total force F' acting on the object is F = Fext + Fyou. Suppose that the initial speed 
of the object is equal to the final speed of the object; frequently, we consider the case where the 


object starts at rest, i.e., with zero velocity, and ends up at rest. 
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Then, Theorem tells us that 


v1 ty 
| Fdz =} (Fext + Fyou) dt = 0, 
ro xo 


ie., 


v1 ry 
J Fyou dt = / — Foyt da. (3.3) 


(0) i) 


In words, if an object starts and ends with the same kinetic energy, then the work that you 
do on the object is the negative of the work done by the external force; this is frequently referred 


to as the work done against F.x4. 


Note that Formula [3.3] is correct even though, usually, it would not be the case, at every 
time or position, that Fyou is equal to —Foxt. For instance, if you begin with an object at rest, 
and apply a force to lift it, acting against the force of gravity, —mg, and end with the object 
higher, at rest, then the force that you exert cannot always be equal to mg; for you must apply 
a force greater than mg to make the mass accelerate, i.e., to make it start moving upward, and 
you must apply a force less than mg later in order for the mass to decelerate back down to zero 
velocity. 


It is crucial in Theorem [3.9.7] and in Formula [3.3] that the mass of the object is constant. 
In our derivations, we used Newton’s 2nd Law of Motion that the net force, F’, acting on 
an object is equal to the mass times the acceleration, ma. This may not be correct if the 
mass is changing. See Remark[3.9.17| Even if F = ma, we used that m was constant when 
we found that [mv dv = mv?/2+C. 


Definition 3.9.10. The (gravitational) potential energy of an object of mass m, at height 
h above the surface of the Earth, is 


Ep = mgh, 


where g is the acceleration due to gravity on Earth. 


We have assumed in the above formula that the heights involved are small enough that the 


force of gravity on the object does not vary significantly. We have also taken zero potential 
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energy to correspond to zero height; however, in fact, it is really only the change in potential 
energy that occurs in calculations, and so, if it’s more convenient to take the zero potential 


energy location elsewhere, you can do so. 


Remark /|3.9.9] applies to the case of (gravitational) potential energy. 


Theorem 3.9.11. Suppose that a force Fyoy is used to raise an object of mass m from rest 
at height h = ho (above the surface of the Earth) to rest at height h = hy, and that the only 
other force acting on the object is the force of gravity. Make the same assumptions on the 
continuity and differentiability of the velocity and position functions as in Theorem [3.9.7 
(using, here, h in place of x). 


Then, the work done by the force Fyou 1s 


hi 
/ Fyourdh = mgh, — mgho, 
h 


0 


1.e., the work done by the force Fyou 1s equal to the change in potential energy. 


Proof. There are two forces acting on the object: Foy and the force of gravity, —mg. By 


Remark we find 


hy hy 
Foudh =f —(-mg)dh = mg(hs ~ ho). 
h 


ho 0 


Remark 3.9.12. In what sense is potential energy actually energy? The potential energy of 
an object is actually the potential energy of the object relative to an ambient (surrounding or 
external) force. What we mean is that, in the presence of the gravitational field of the Earth, if 
you release an object of mass m from rest, at a height h above the ground, the object will fall, 
increasing its speed. In fact, ignoring air resistance, Theorem [3.9.7] tells us that the work done 
by the force of gravity will convert the object’s potential energy into kinetic energy; you can 
calculate the speed v at which the object will strike the ground simply by solving for v in the 
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equation mgh = mv?/2. The fact that the object will spontaneously start moving, and acquire 
kinetic energy, is why we want to say the object had an initial energy to begin with. 


However, it is important to realize that an object, in and of itself, without the presence of 


6 


some external force, does not possess “potential energy”. If the Earth suddenly vanished, an 


object with positive gravitational potential energy would immediately lose this potential energy. 


There is potential energy associated with force fields other than gravitational force. All that 
is really required to have a reasonable notion of potential energy is that you have a conservative 
force field. While this is a topic for multivariable Calculus, physically, a conservative force field 
is one in which the work done by the force field in moving an object from one point to another 
is independent of the path along which the object moves; all that matters is the beginning and 
end points of the path. 


Now we’re ready to handle a classic example, after we make the following comment /warning: 


In the following example, we use Newton’s 2nd Law of Motion and conclude that the net 
force acting on an object is equal to the mass of the object times its acceleration, i.e., we 
conclude that F = ma, even though the mass of the object is changing. This is okay 


in this example since we assume that the mass which is leaving the object (a leaky bucket 
of sand) has the same velocity as the bucket as the sand leaks out. See Remark |3.9.17| for 
a detailed discussion of the issue. 


Example 3.9.13. Suppose that you are lifting up a bucket of sand 40 feet, at a constant velocity 
of 2 ft/s. Suppose that the bucket initially contains 50 lb of sand, but is leaking sand at a rate 
of 1 pound of sand per second. How much work do you do in lifting the bucket of sand? 


Figure 3.62: Lifting a leaking bucket of sand. 


Assume that the rope/wire used to lift the bucket has negligible weight, that by “the velocity 


of the bucket” we mean the velocity of the center of mass of the bucket and the sand that’s in 
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it, and that the instantaneous velocity of the sand as it leaves the bucket is equal to the velocity 
of the bucket, i.e., assume that the escaping sand leaves the bucket with velocity 2 ft/s (and 


then starts decelerating once gravity is the only force acting on it). 


Note that gravity acts downward, while the displacement is upward. Thus, the work done by 
gravity will be negative, while the work that you do will be positive. We take the acceleration 
of gravity to be g = 32 ft/s?. 


Set up a y-axis vertically, with 0 at the initial position of the bucket, and the positive 
direction being upward. Let t be the amount of time elapsed, in seconds, from the time at which 
you start raising the bucket. 


The force of gravity Fyray on the bucket is its weight, with a minus sign, to indicate that 
gravity acts downward. As the bucket initially weighs 50 lb, and loses 1 pound per second, we 
have 


Fray = —(50—t) Ib. 


But we need Foyay as a function of y, so that we can calculate the work done by gravity Weray = 
40 
f Foray dy. 


Since the bucket is moving with a constant velocity of 2 ft/s, the position of the bucket at 
time t seconds is y = 2t feet. Hence, t = y/2, and so 


Ferav = - (50-8) Ib. 


As we mentioned before starting this problem, since the sand is leaving the bucket at the 
same velocity as the bucket, the total/net force F = Fyou + Ferav acting on the bucket is ma, 
where m is the mass of the bucket of sand and a is its acceleration. But, the bucket of sand is 


moving with constant velocity; so a = 0, and we conclude that 


PF = Pyou + Foray = 0, 


ie., that Fyou = —Feray = (50 — 4) ft-lb. 


Now, we calculate easily 


40 40 
y y 
Wyou = Fyou dy = i 50-—=) dy = (soy - ©) 
e — 0 ( ) 4 
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(2000 — 400) = 1600 ft-lb. 


A standard example of a force which changes when the position changes is provided by a 
spring, which has a block attached to it. In a sense, we don’t really need to have a block 
attached to the spring for a discussion about the work involved in compressing or stretching the 
spring. However, it is convenient to assume that the spring itself has negligible mass, and the 
only relevant mass in the problem is that of the block. 


Thus, let’s suppose that the left end of a horizontal spring is attached to a wall, and a block 
is attached to the right end of the spring. Suppose that the center of the block is at x = 0 
meters, when the spring is at its natural length. This is called the equilibrium position for the 
block. 


When the block is to the left of the equilibrium position, so that the spring is compressed, 
the force exerted by the spring pushes the block to the right. When the block is to the right of 
the equilibrium position, so that the spring is stretched, the spring force pulls the block to the 


left. See Figures and 


spring force —» +— spring force 


0 


Figure 3.63: A compressed spring. Figure 3.64: A stretched spring. 


Springs, if they are not compressed or stretched too far, are typically assumed to obey Hooke’s 
Law: the force that the spring exerts is proportional to the displacement from the equilibrium 


position, and acts in the direction opposite the displacement. 


Let F = F(a) denote the force exerted by the spring on the block, when the center of the 
block is at position x, as measured from its equilibrium position. Note that the displacement 
of the center of the block from its equilibrium position and the displacement of the right end of 
the spring from its equilibrium position are the same; thus, we may use the block’s displacement 


and equilibrium position in applying Hooke’s Law. Hence, what Hooke’s Law tells us is that 
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there exists a positive constant & such that 


F = —kz. 


The constant k depends on the spring in question, and is called the spring constant. 


Example 3.9.14. Suppose that a spring has spring constant k = 50 N/m. How much work 
does it take to compress the spring 1 meter from the equilibrium position? 


Solution: 


While it is not stated explicitly in the problem, it is implicit that the spring/mass starts 
at rest and ends at rest. In addition, unlike the leaky bucket problem in Example [3.9.13] the 
mass is not changing. Hence, we may apply Formula3.3] to conclude that the work done by the 
compressing force is negative the work done by the spring, i.e., 


-—1 -1 —1 kx -1 
| Feomp dx = / —F pring dz = i —(—kz) dx = — = 25 joules. 
0 0 0 2 0 


Now we come to two examples where, instead of using that an infinitesimal amount of work 
dW is F dx, we use that dW = qdF, where dF is an infinitesimal amount of force and q is the 
displacement of the mass on which dF acts. 


Example 3.9.15. Suppose that we have a tank, which is a right circular cylinder of radius 8 
feet and height 30 feet. The tank contains water up to a height of 10 feet (and the rest of the 
tank is empty). Assume that the weight-density of water (weight per volume, a.k.a., the specific 
weight) is by = 62.4 lb/ft®. 


We want to find the amount of work required to pump all of the water to the top of the tank. 
We are not discussing any extra work involved in then moving the water off to a pipe at some 
location at the top of the tank; in particular, we mean the work involved in moving the water 
to the top of the tank at rest. As the water is not initially moving, and we are calculating the 
work involved in leaving the water at the top of the tank at rest, the work involved in pumping 
the water is negative the work that gravity does on the water as we move the water to the top, 


i.e., the work done against gravity. 
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Okay. So how do we find the work done against gravity? Set up a z-axis, with z = 0 at 
the bottom of the tank, and with the positive direction being up. For each z-coordinate where 
we have water, that is, for z between 0 and 10, we calculate the work involved in lifting the 
infinitesimally thickened z cross section of water. We then add up, i.e., take the integral of, all 
of these infinitesimal pieces of work as z goes from 0 to 10. 


Consider the infinitesimally thickened z cross section of water. As in Section the in- 
finitesimal volume of this piece is dV = A(z) dz, where A(z) is the cross-sectional area. In our 
current setting, A(z) is constant; it’s the area inside a circle of radius 8 feet. Thus, A(z) = 647 
ft?, and dV = 647 dz. The (negation of the) infinitesimal force that gravity exerts of this chunk 
of volume is 

dF = bydV = (62.4)(647) dz lb. 


How much do we have to displace the slab of volume at coordinate z to get it to the top of the 
tank? As the tank is 30 feet high, the needed displacement is 30 — z feet. Thus, the infinitesimal 
work done against gravity in order to lift the infinitesimally thick piece of volume at a given 
z-coordinate between 0 and 10 is 


dW = (30—2)dF = (30 — z)(62.4)(64m) dz ft-lb. 


Therefore, the total work required to move all of the water to the top of the tank is 


10 


“— [aw _ [0 2)(624)(64%) = (62.4) (647) (s0:- =) 


0 


(62.4) (64m) (300 — 50) * 3,136,566.1 ft-lb. 


Note that the limits of integration refer to the z-coordinates at which there is water which 
needs to be lifted. It is a common mistake to let z go from 0 to the height of the tank. Keep 


in mind what the integral is doing: it’s adding up all of the infinitesimal pieces of work from 
each of the z cross sections at which there’s water that’s being lifted. 


We could have obtained this same answer in a different way. In Exercise [32| we ask you to 
show something that you may have guessed: that you can calculate the work in this problem 
(or any problem like it) by taking the total weight of the water and then multiplying by the 
“right” displacement. What is this right displacement? The distance between z-coordinate of 
the center of mass of the water and the top of the tank. 


In more-complicated problems, calculating the work by finding the total weight and z- 


coordinate of the center of mass would not be significantly easier than calculating a corre- 
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sponding work integral. However, in our current problem, the calculation via the center of mass 
is trivial. By symmetry, the z-coordinate of the center of mass is z = 5 feet. The total weight 
of the water is 5, = 62.4 Ib/ft® times the volume of a right circular cylinder of radius 8 feet and 
height 10 feet. Therefore, we quickly find that work required to move all of the water to the top 
of the tank is 


(62.4) (64m)(10)(30—5) ft-lb, 


which, of course, agrees with what we calculated above. 


In this example, we change the tank from the previous problem; instead of using a right 


circular cylinder, we use an upside-down cone. 


Example 3.9.16. Suppose that we have a tank, which is a right circular cone, which is (mys- 
teriously) balancing on its point. Suppose that the top radius of the cone is 8 feet and height is 
30 feet. The tank contains water up to a height of 10 feet (and the rest of the tank is empty). 
Assume that the weight-density of water (weight per volume) is 6,, = 62.4 Ib/ft?. We once again 
want to find the amount of work required to pump all of the water to the top of the tank. 


What changes from our previous example? In a sense, very little. The only thing that 
changes is our formula for the cross-sectional area of the tank at a given z value between 0 and 
10 feet. Our z cross sections are still disks, but the radius of the disks gets smaller as we get 
closer to the bottom of the tank. 


By using similar triangles, we quickly conclude that the radius r of the cross section at z 


satisfies 
Zz r Az 


30 — 3 and so r= os 


Therefore, the cross-sectional area is given by 


2 2 
A(z) = “(Z) = ir. 
15 


Hence, instead of calculating 


z=10 10 
W = I, dW = | (30 — z)(62.4) (647) dz, 
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which is what we did in Example]3.9.15| we need to calculate 


z=10 10 1 2 
W = ; dW = i (30 ~ 2)(62.4) one ) dz. 
2=0 0 5 


We find 


10 


(62.4)(167) f'°. (62.4) (167) z 
W = me (302? —z*)dz = 395 (10s - =) 


0 


2.4)(1 
(62.4)(16T) (49 999 — 2500) ~ 104,552.2 ft-lb. 


We end this section with a serious discussion of Newton’s 2nd Law of Motion for objects 
with changing mass. We should explicitly mention that this is part of Newtonian Mechanics, 
and does not include a discussion that takes Einstein’s theories of relativity into account; this 
means that we are dealing with velocities which are small compared to the velocity of light, so 


that the relativistic effects are insignificant. 


Remark 3.9.17. When an object changes mass, it either gains or loses mass, and the appro- 
priate form of Newton’s 2nd Law depends on the relative velocity of the center of mass of the 
object and the mass which is gained or lost. Understand that part of the issue with changing 
mass is that what we are calling “the object” changes; we suddenly start calling the old object, 
together with the new particles that joined it, the “new object”, or we stop referring to particles 
that used to be part of the object as part of the “new object”. 


Let m denote the mass of the object at time t, and let F' denote the net force acting on the 
object. Then, dm/dt will be positive if the object is gaining mass and negative if the object is 
losing mass. Let v;.; denote the velocity, relative to the velocity of the center of mass of the 
object, of the gained or lost mass instantaneously as it joins or departs the object; that is, v;¢ 
is the velocity of the center of mass of the gained or lost mass according to someone is moving 
along with the center of mass of the object, and who considers their own velocity to be zero. 
This means that if an outside observer says that the velocity of the center of mass of the object 
is v and the velocity of the center of mass of the gained or lost mass is vg}, then 


UVrel = Ugl — U. 
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Then, the correct form of Newton’s 2nd Law is 


d 
—=—- = moa + (w=) re (3.4) 


Therefore, if the center of mass of the mass that is gained or lost is moving at the same velocity 
as the center of mass of the object, such as in the leaky bucket problem in Example|3.9.13} then 
U — Up) = 0 and Formula collapses to the familiar 


i.e., the net force equals the rate of change of the momentum, with respect to time. This 
situation is a reasonable approximation of what happens as hail gains mass as it falls through 
the atmosphere and smashes into essentially motionless water molecules, which become part of 
the hail. This case may also apply to a rocket, if the rocket is expelling burned fuel at velocity 
0, ie., at velocity —v, relative to the rocket. 


We should remark that you need to be careful with Conservation of Energy calculations when 
objects collide or separate. When objects collide, momentum is conserved, but kinetic energy 
need not be conserved; some of the kinetic energy present before the collision may be “used up” 
in deforming the objects which are colliding. These deformations involve the objects heating 
up, so that some of the kinetic energy is converted into heat. Such a collision, in which kinetic 
energy is not conserved, is called an inelastic collision. Thus, in an inelastic collision, the total 
kinetic energy drops. A collision in which the objects actually merge to form one object, such 


as when motionless water molecules attach to hail, is referred to as totally inelastic. 


When objects separate, there is the analogous concept of a totally inelastic separation. In a 
totally inelastic separation, the kinetic energy goes up. This implies that extra energy had to 
come from somewhere, such as a chemical reaction which releases heat. A typical example of 


this is when a rocket expels burned/burning fuel. 
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3.9.1 Exercises 


In Exercises 1 through 5, calculate the work done in moving a particle subject to 
the force F(x) as it moves between the endpoints of the interval. Assume that x is 
position of the particle along the z-axis, measured in meters, and F is measured in 


Newtons. 


1. F(x) = 3a? —- 24 +1, -3<a<6. 


2. F(x) =4sin(2xz)+Inz, 1 <a<a. eo 
3 
3. F(a)= 4, 55289. 


2 
1 
4, F(a) =" o<a<4 
av 


BPG) =fe-" LA eS. 


6. You exert a force Fyo, on an object. The only other force acting on the object is an external 
force, Fex,. The object is moving along the x-axis. If the initial and final positions of the 
object are the same, is it true that the total work experienced by the object is zero? 


Explain your answer. 


7. You exert a force Fyo, on an object. The only other force acting on the object is an 
external force, Fert. The object is moving along the z-axis. If the initial and final speeds 
of the object are the same, is it true that the total work experienced by the object is zero? 


Explain your answer. 


In Exercises 8 through 12, you are given the position function x(t) of an object with 
constant mass m kilograms moving along the z-axis between times ty and ¢,. Cal- 
culate the work done in displacing the object from x(tg) to x(t,). The displacement 


is measured in meters and time is measured in seconds. 
8. x(t) = cost, to = 0, ty = 7/2. eo 

9. x(t) = cost, to = 0, ty = 7. 

10. x(t) = —4.9t? + 2¢ + 100, to = 0, ty = 3. 


11. x t) = 3ln(t + 1), to = 0, ty =9. 


12. x(t) =cosht, £=0,t = 4. 
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In Exercises 13 through 17, an object experiences a force depending on its position 
zx, and a constant k. The total work, W, done by the force on the object as it moves 
along the z-axis from positions xp to x; is also given. Solve for k. W is measured in 


joules, Ff is measured in Newtons, and displacement is measured in meters. 


13. F =—k/2?, x9 =2, 21 =1, W = 112. 


14. F=a+k, to =3, 41 = 12, W = 96. 


15. F=—k/z, 29 =4, 21 =8, W = 80. 


16. F=ke*, ro = 0, x1 = 3, W = 72. 


17. F=kax?, x9 = 4, 21 = 5, W =3. 


In Exercises 18 through 22, you are given the mass and velocity, u(x), of an object 
moving along the z-axis as a function of its position. Calculate the total work done 
as the object moves from x9 to 7,;. Mass is measured in kilograms, velocity in meters 


per second, and displacement in meters. 


18. v(x) = 327 +2, m= 12, 2p = 2, 21 = 5. 


19. v(x) = 9, m = 15, wp = 2, x1 = 29. 


20. u(x) = sintaz, m = 20, zo = 0, x1 = 1. 


21. v(x) = |9x — 2|, m=9, ao = 1, 4%) =3. oe 


22. u(x) = en m=11, % =5, 47, =5. 


23. A worker is cleaning windows on a tall building. The combined mass of the worker and 
the lift used to transport him is 150 kg. If the distance between each floor is 3 meters, 
how much work is required to elevate the worker and the lift from the 12th to the 22nd 


floor? 


24. A ball is dropped from a height of 50 meters. What is the speed of the ball when it hits 


the ground? Solve by equating formulas for potential and kinetic energy. 


25. Redo Example |3.9.13] under the following assumptions: the bucket of sand is lifted at 
constant velocity 4 ft/s. The bucket initially contains 80 lb of sand and the sand is leaking 


at a rate of 0.5 lb per second. How much work is done lifting the sand 60 feet? 
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26. Consider a bucket-lifting problem where all the initial data is the same, as in Exam- 
ple [3.9.13] with the following exceptions: The bucket is to be lifted to a height of 120 feet 
instead of 40 feet. The bucket leaks sand until only 1 lb of residual sand remains. How 
much work is done in lifting the bucket to 120 feet? 


27. Generalize Example [3.9.13} Suppose that you are a lifting a bucket of sand h feet at a 
constant velocity v ft / s. The bucket initially contains Py lb of sand, but is leaking sand 
at a rate of r pounds of sand per second. Assume Pp > a How much work is done in 
lifting the bucket? ° 


In Exercises 28 through 30, calculate the work required to compress the spring with 


the given constant by d meters from the equilibrium position. Use Hooke’s Law. 


28. k= 10 N/m, d= 2 meters. 


29. k=12 N/m, d=3 meters. 


30. k=6 N/m, d=5 meters. eo 


31. Generalize Theorem|3.9.11}by removing the assumptions that the initial and final velocities 
of the object are zero, and calculating ie Fyou dh in terms of changes in the potential 
energy and the kinetic energy of the object. 


32. Suppose that a tank has height H and has z cross-sectional area given by the continuous 
function A(z), where z = 0 corresponds to the bottom of the tank, and up is the positive z 
direction. Suppose also that the tank contains a liquid, which fills the portion of the tank 
between z = 0 and z = b, where 0 < b < H, and that, at each fixed z-coordinate between 
0 and b, every point in the z cross section has the same weight-density 6,,(z). Finally, 


suppose that 6,,(z) is continuous. 


Show that the work required to pump all of the liquid to the top of the tank is equal 
to the total weight of the liquid times the distance between the top of the tank and the 


z-coordinate of the center of mass of the liquid. 


We now consider a particle which is subjected to a force as the particle travels along 


a path in three dimensions. We use the vector material in Appendix [A] 


Let a(t) = (x(t), y(t), z(t)) be a parameterization of the path where a < t < b. 
Suppose that F : R3 — R3 is a vector force field. Then, we define the work done by 
the force to the particle along the path using integration and the dot product: 
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Here, we are assuming that F is continuous and @ is differentiable. Note that even 


though the particle is traveling in three dimensions, the calculation of work involves 


a one-dimensional integral. 


33. 


Suppose F(a, y, z) = (2x2, ryz,5 — z), is a vector force field acting on a particle whose 
position is given by @(t) = (t?,t,5), 1<t <2. 


a. Show that F(a(t)) = (2t?, 5t3, 0). 
b. Show that F(a(t)) - @’ (t) = 92°. 


2 
c. Calculate the work done by F on the particle on the path a(t) by calculating : Oe di. 
1 


In Exercises 34 through 38, calculate the work done on a particle by the force field 


F as the particle moves along the parameterized path. F is in Newtons, distances 


are in meters, and time is in seconds. 


34. 


35. 


36. 


37. 


38. 


> 


F(a, y,z) = (y, 2, 2), &(t) = (cost, sint, t), 0 < t < 2rn. 

F(x, y, 2) = (yz, xz, xy), A(t) = (2t, t, -3t), -l<t<1. 

F(a, y,z) = (3a? — 22 +1,0,0), &(t) = (t,0,0), -3 <t <6. Note that this is just another 
way of presenting Exercise [I] of this section. 


=> 


F(«,y,z) = (sinha, y,z), a(t) = (0, 5sint, 5cost),0<t < 2r. 


a _ TY 
F(z,y) = x2 + y?? 2? 4+ y? ’ 


path and the force field are both two dimensional. We can fit this exercise into our 


a(t) = (cost, sint), 0 < t < 27. In this case, the particle’s 


definitions above by declaring the third components of both the particle’s path and the 
force field to be constantly zero. 


In Exercises 39 through 42, it will be helpful to recall Example |3.9.15) and Exam- 
ple|3.9.16| In particular, you should use that the weight-density of water is 6, = 62.4 
Ib /ft®. 


39. 


AO. 


A rectangular tank has length 20 feet, width 8 feet, and height 30 feet. The tank contains 
water up to a height of 10 feet (and the rest of the tank is empty). Find the amount of 
work required to pump all of the water to the top of the tank. 


Suppose that we have a tank, which is a right circular cylinder of radius 8 feet and height 
30 feet, and the tank is initially full. Find the amount of work required to pump all of the 
water to the top of the tank. 
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Al. 


42. 


Suppose that we have a tank, which is a right circular cone, which is balancing on its point. 
Suppose that the tank is full, and that the top radius of the cone is 8 feet and height is 30 
feet. Find the amount of work required to pump all of the water to the top of the tank. 


Suppose that we have a tank, which is a right circular cone, which has its circular base at 
the bottom, and its vertex at the top. Suppose that the radius of the base is 8 feet and 
height is 30 feet. The tank contains water up to a height of 10 feet (and the rest of the 
tank is empty). Find the amount of work required to pump all of the water to the top of 
the tank. 


3.10. HYDROSTATIC PRESSURE 385 


> 
3.10 Hydrostatic Pressure 


The term “hydrostatic pressure” refers to pressure that acts on a surface which is submerged 


in motionless water (“static” for motionless, “hydro” for water), and while we shall, in fact, 
discuss mainly the case of water on Earth, our discussion applies to essentially any body of 
liquid anywhere. 


When an object is partially, or totally, submerged in a liquid, such as water, the surfaces of 
the object which are exposed to the liquid experience pressure, force per area, which depends 
solely on the depth of the liquid at each point on the exposed surface. Assuming that the body 
of liquid is open above the submerged surface, this pressure can be thought a result of the weight 
of the liquid above each point. 


Imagine a plate of area A submerged in a liquid which has a weight-density (weight per 
volume) of 6,,, and assume that the plate lies at a constant depth D beneath the (open) surface 
of the liquid. We assume that the air pressure on the surface of the liquid is negligible. Then 
the weight, i.e., the force F' due to gravity, of the liquid above the plate is the weight-density 
dw Of the liquid times the volume, AD, of liquid above the plate. Thus, F' = 6,,AD, and so the 


pressure, the force per area, is 


P = F/A = 6,D. (3.5) 


While a complete discussion of the physics and mathematics is beyond the scope of this 
book, amazingly, the pressure P on a submerged surface does not depend on the orientation of 
the surface inside the liquid; the pressure P at any point at depth D on the submerged surface 
produces a pressure vector, which has magnitude P = 6,,D and points perpendicularly (a.k.a., 
normally) into the surface. In fact, the liquid does not even have to be open to the air (or 
vacuum) directly above the submerged surface; all that matters is the weight-density of the 
liquid, and the depth of each point on the submerged surface below the line determined by the 
open surface of the liquid even if that open surface of the liquid does not lie directly 


above the submerged surface. 


When a solid object is submerged (partially or totally) in a liquid, the fact that lower parts 
of the surface of the object experience more pressure than higher parts explains the existence 
of buoyancy force, the force with which the liquid pushes up on the object. Buoyancy force is 
easy to describe using Archimedes Principle: the buoyancy force exerted on a submerged object 
is equal to the weight of the amount of the liquid displaced, i.e., if the volume of the submerged 
portion of the object is V, then the buoyancy force pushing upward on the object is equal to 
dwV. This assumes that the entire surface of the object which lies below the level of the surface 


of the liquid is exposed to the liquid (for instance, the bottom of the object should not be flush 
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with the base of a water tank). 


The serious mathematical treatment of many of the general results pertaining to hydrostatic 
pressure, in particular, Archimedes Principle, requires the use of multivariable Calculus and 
integrating vector fields. However, we shall restrict ourselves to a type of problem which we can 
handle with single variable Calculus; we will look at the total force on a flat surface which is 
vertical inside our liquid, i.e., a submerged surface that is part of a plane which is perpendicular 


to the surface of the liquid. 


Example 3.10.1. Suppose that one end on a swimming pool is parabolic (and vertical). Set 


up a vertical y-axis, with y = 0 at the bottom of the pool, and where the end of the pool is the 


2 


region above y = x* and below y = 9, where all distances are measured in feet. Assume the 


pool is full of water, and that the weight-density of water is 6, = 62.4 Ib/ft®. 


Figure 3.65: The parabolic end of a swimming pool. 


We want to determine how much force the water exerts on this parabolic end. 


How do you approach such a problem? Well, for 0 < y < 9, all of the points at that given y- 
coordinate have depth D = 9—y, and so experience hydrostatic pressure of P = dwD = by(9—y). 
We need to multiply this pressure by the infinitesimal area dA at that y-coordinate, to obtain 


the infinitesimal force dF. 
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Figure 3.66: A strip of infinitesimal area at a given y-coordinate. 


The infinitesimal area at coordinate y is the infinitesimal height dy times the length LD of 
the infinitesimally high rectangle inside our region at the given y-coordinate. That length is the 
distance between the left x-coordinate on the graph of y = x7, at the given y value, and the 
right x-coordinate on the graph. Solving for x in terms of y, we find that x = +,/y, and so the 
two x-coordinates in question are —,/y and \/y. Hence, L = 2,/y, and 


dF = PdA = 6,(9—y)Ldy = 62.4(9 — y)2,/y dy lb. 


Therefore, the total hydrostatic force F' on the parabolic end of the full pool is the continuous 


sum of all of the infinitesimal contributions: 


y=9 9 9 
F= / dF = i 62.4(9 — y)2/ydy = 1248 f (9yl/? — y3/?) dy = 
y=0 ) 0 


9 
2 
= 124.8 ¢ “20> 5 243) = 8087.04 lb. 
0 


y!? 
OA: . 
(9 3/2 5p) 


Let’s record the result of our discussion above in a proposition. 
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Proposition 3.10.2. Suppose you have a vertical surface, where y is the vertical coordinate, 
and that the portion of (one side of) the surface between y = a and y = b (a < b) is submerged 
in water, where the top level of the water is at y= T. If L(y) is the cross-sectional length 
of surface at y, and by is the weight-density of water, then the total force, pushing into the 
given portion of the surface, due to hydrostatic pressure, is given by 


b 
/ bw (LT — y) L(y) dy. 


Example 3.10.3. Consider the same swimming pool as in the previous example, but now 
assume that the water in the pool is only 4 feet deep, i.e., there is water against the parabolic 


wall between y = 0 and y = 4 feet. Now how much force does the water exert against the wall? 


Figure 3.67: Now the pool is not full. 


Solution: What changes and what stays the same from the previous example? In our current 
situation, for 0 < y < 4, all of the points at a given y value have depth D = 4 — y, because 
the depth (as far as hydrostatic pressure is concerned) is measured from the waterline, not from 
the top of the pool. Thus, the hydrostatic pressure at each point at a given y-coordinate is 
P = bwD = by(4— y). 
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The infinitesimal area at coordinate y is still the infinitesimal height dy times the length D 
of the infinitesimally high rectangle inside our region at the given y-coordinate. That length is 
still L = 2\/y, and so 


dF = PdA = 6,(4—y)Ldy = 62.4(4— y)2,/y dy lb. 


The total hydrostatic force F on the parabolic end of the full pool is the continuous sum 
of all of the infinitesimal contributions, but now there are contributions only for 0 < y < 4. 
Therefore, 


y=4 4 4 
F= / ar = [ 62.4(4—yavudy = 1248 f (ay! —y°/*) ay = 
y 0 0 


g 8 D 
a 124s ($-8- 2-32) —~ 1064.96 Ib, 


3/2 5/2 

] y 
124.8 (4.-—_— — + _ 
( ) 3 5 


3/25 /2 


0 


which is far less force than we found in the previous example in which the pool was full. 


3.10.1 Exercises 
Throughout these exercises assume that the weight-density of water is 62.4 lb/ft? or 9800 N/m°. 


1. Suppose that one side of a swimming pool is a flat, vertical rectangular wall of length 12 
feet and height 10 feet, and the pool is full, i.e., the water is 10 feet deep. Find the total 


hydrostatic force on the given wall. 


2. Consider the previous exercise. 


a. How much force is exerted on the lower half (half, by height) of the wall? 
b. How much force is exerted on the upper half of the wall? 


c. Explain why one of these values should obviously be larger than the other. 


3. Suppose that one side of a swimming pool is a wall which is a flat, vertical isosceles 
trapezoid, with height 10 meters, and that the upper parallel side has length 12 meters, 
and the lower parallel side has length 8 meters. Suppose that the pool is full, ie., the 


water is 10 meters deep. Find the total hydrostatic force on the given wall. 
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4. Consider the previous exercise. 


a. How much force is exerted on the lower half (half, by height) of the wall? 
b. How much is exerted on the upper half of the wall? 


c. Explain why, this time, unlike in Exercise 2, it is not so obvious that one of these 


values should obviously be larger than the other. 


5. Suppose that one side of a tank is a flat, vertical semi-circular wall of radius 6 feet, with 
the “big end” up, i.e., the diameter of the semi-circle is at the top of the tank. Suppose 
that the water is 4 feet deep (that is, the top of the water is 4 feet above the bottom of 


the semicircle). 


a. Set this up with y = 0 at the top of the wall. Show that the total hydrostatic force 


is given by 


-2 
/ 62.4(—y)2 36 — y? dy lb. 


6 


b. Evaluate the integral from part a. 


6. Suppose now that the semi-circular wall, of radius 6 feet, from the previous problem is 
upside-down, i.e., the diameter of the semi-circle is at the bottom of the tank. If the water 
in the tank is still 4 feet deep, determine the hydrostatic force on the wall. 


7. Suppose that one side of a tank is a wall which is a flat, vertical isosceles triangle, with the 
equal angles at the top, and a vertex at the bottom. Let H be the height of the triangle, 
and let W be the width at the top of the triangle. Suppose that the tank is full, i-e., the 
water is H meters deep. Find the total hydrostatic force on the given wall. 


8. Consider the previous exercise. 


a. How much force is exerted on the lower half (half, by height) of the wall? 
b. How much is exerted on the upper half of the wall? 


c. What do you observe about your answers in parts (b) and (c)? 


9. Let’s generalize what you should have observed in the previous exercise. Suppose that one 
side of a tank is a wall which is flat and vertical. Let y = 0 correspond to the bottom of 
the wall, and y = b correspond to the top. Suppose that the cross-sectional length of the 
wall is given by L(y) = ky, for some constant k > 0. Suppose the tank is full, so that the 


top of the water is also at y = b. 


Show that the hydrostatic force on the lower half (half, by height) of the wall is equal to 
the hydrostatic force on the upper half. 
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10. Consider a vertical end of a swimming pool, as in Example [3.10.1] and Example [3.10.3] 
Let A be the area of the region R of the vertical end that’s covered with water. Set up a 
vertical y-axis. Then, show that the total hydrostatic force acting on the swimming pool 
end is equal to 6,AD, where 6, is the weight-density of water and D is the depth (below 
the waterline) of the y-coordinate of the centroid of R (see Section [3.8). 


Chapter 4 


Understanding Functions via 
Polynomials and Power Series 


Fundamentally, the question that we address in this chapter is: what does it mean to “know 


what a function is”? 


Do you really know what a function f(x) is if you can’t calculate the value of f, to some 
desired accuracy, for each x in the domain of f? Or, another way to look at this is: does your 
calculator know what functions are better than you do? It can calculate values, to the accuracy 


1 


of the display, of functions like e”, Ina, sina, and tan~* x. Can we do the same thing? 


However, knowing what a function is goes beyond what your calculator can easily handle. 
We'd also like to know when one interesting combination of functions is equal to another inter- 
esting combination of functions, as functions, not just for specific values of the independent 
variable; in other words, we’d like to produce identities between functions, and/or recognize 


when we have them. 


As we shall see, you can often approximate functions to any desired accuracy by using poly- 
nomials, and many functions that we use often are equal to power series, basically polynomials 


with an infinite number of terms. 


Because polynomial functions are so easy to evaluate and manipulate, approximating via 
polynomials, and/or representing a function as some sort of never-ending polynomial, is ex- 
tremely valuable. As you may recall, in Worldwide Differential Calculus, [2], we actually defined 
the exponential function exp(«) = e* by using a power series, which made it very easy to derive 
the important properties of exp(x), and allowed us to approximate the value of e = exp(1) to 


any desired accuracy. 


Most of the technical details of this chapter are theorems that will appear in the next 
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chapter. Most textbooks place the study of these theorems before the results on polynomial 
approximation and power series. Sadly, this seems to lead to many students missing the most 


important and useful aspects of the theory. 


4.1. APPROXIMATING POLYNOMIALS 395 


> 
4.1 Approximating Polynomials 2 


In this section, we will discuss approximating polynomials by using just some of their terms/summands. 
This will also require us to rewrite polynomials, where, instead of using powers of x, we use 
powers of (2 — a), where a is some fixed value, e.g., powers of (x — 2). As we shall see, this 


rewriting is important for approximating well for values of x near a. 


Throughout this section, m and n will both denote integers which are greater than, or equal 


to, zero. 


Of course, you know that a polynomial function is a function p(x) defined by a polynomial, 
€.g., 
pz) = 4-22 +527 + 92°. 


Note that the term polynomial actually refers to the algebraic expression, while polynomial 
function means the function defined by the algebraic expression; this distinction will never 
cause us a problem, for we will always be very explicit about the algebraic ways in which we 
wish to rewrite polynomials, even though the original and rewritten polynomials will, of course, 


both define the same function. 


Using our sigma/summation notation (from Definition |2.1.1), we can write a general poly- 


nomial, in powers of z, as 


d 
p(t) = co teat cgx? + e322 ++--+ceqn" = » CHe™, 
k=0 


where d is the degree of the polynomial, provided that cg 4 0. Note that, in the above, formula, 


we have assumed that 2° = 1, even if « = 0. This is very notational convenient. 


0 


We shall adopt the convention that the expression x” is equal to 1, even when x = 0. 


We do this despite the fact that 0° is, in general, undefined. This convention simply saves 
us from having to write a lot of extra words about what we mean in various formulas, in the 


special case where x = 0. 


Much of this section is based on the simple observation: 
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Proposition 4.1.1. When x is close to 0, higher powers of x are closer to 0. More precisely, 


ifl<m<n, and0< |z| <1, then |x"| < |2™|, and lim,_,.. |z”| = 0. 


So? What does this do for us? It gives us the (imprecise) general principle: 


When = is close to 0, a polynomial function p(x) can be approximated “well” by leaving off 


terms of the form #”, where n is “large”. 


We shall make the above general principle more precise in Section [4.3] 


Let’s look at an example. 


Example 4.1.2. Consider the polynomial function 


p(x) = 4 — 22 + 5x? + 92°. 


You may be used to thinking of the highest-power term 92° as the most important term of 
the polynomial and, if x is large in absolute value, then that’s certainly correct. But, when x 
is close to 0, x° will be very small, even after multiplying by 9. The point is: when z is close 
to 0, the 0-th degree term, 4, is the most important, —2z is the second most important term, 
and 5a? is third most important; the 92° term is the least important if x is close enough to 0. 


How close does x need to be to 0 to make the lower-degree terms larger in absolute value 
than the higher-degree terms? That depends on the precise coefficients in p(x), and we will 
address such questions in a more general context in Section For now, we’re going to plug 
in some numbers and look at the approximations that we obtain when we omit the terms of 


“large” degree. 


However, before we start plugging in numbers, let’s adopt the notation that p”(«) means the 
sum of all of the terms of p(x), from the constant term up to the x” term (even if that term is 


missing, i.e., has a zero coefficient). 


Hence, 
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p'(t)=4-2¢4+527, and p°(x) = p(x) = 4-284 527+ 92°. 


We won’t need beyond p®, but it is still worth noting that p"(x) = 4 — 2x7 + 5a? + 92°, for all 
n> 5. 


When z is close to 0, we want to approximate p(x) by leaving off its terms of “large degree” , 


ie., we want to look at the approximation p(a) = p”"(a), for various n, when z is close to 0. 


Let’s fix x = 0.1, and look at the approximations p(x) © p°(x), p(x) © p'(x), and p(x) ~ 
p(x). First, we find 


p(0.1) = 4—2(0.1) +5(0.1)2 +9(0.1)® = 4—0.2+0.05 + 0.00009 = 3.85009. 


p°(0.1)=4, p'(0.1)=4—2(0.1)=3.8, and p*(0.1) = 4—2(0.1) + 5(0.1)? = 3.85. 


As you can see, the partial polynomials, or partial sums, above give “reasonable” approximations 
to the actual value of p(0.1), that are accurate to within (plus or minus) 0.14991, 0.05009, and 
0.00009, respectively. 


What happens if we pick an x which is even closer to 0? Let’s look at x = 0.01. Then, we 
find 


p(0.01) = 4—2(0.01) +5(0.01)?+9(0.01)®> = 4—0.02+0.0005+0.0000000009 = 3.9805000009, 


and 


p°(0.01)=4,  p'(0.1) = 4—2(0.01) = 3.98, and  p?(0.01) = 4—2(0.01)+5(0.01)? = 3.9805. 


We see that the partial sums now give even better approximations to the actual value of p(0.01); 
the approximations by the partial sums now are accurate to within (plus or minus) 0.0194999991, 
0.0005000009, and 0.0000000009, respectively. 
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Let’s look at another sort of approximation problem. 
Example 4.1.3. Suppose again that p(x) = 4 — 2x + 5x2? + 92°. If x is close to 0, then what 
power of x best approximates (p(x) — 4 + 22) /5? 
Solution: 


We find 


(p(x) -—44+2x)/5 = (4-20+527 + 90° —442x)/5 = 2? + 9x°/5. 


Thus, the question is: when x is close to 0, what power of x best approximates x? + 9x°/5? 


The answer is: the 2nd power, since the smallest degree term, the x? term, is most relevant 


when z is close to 0. 


Okay. Great. Now we know that, when z is close to 0, we can approximate polynomials, 
that are written in terms of powers of x, by taking the first “few” lowest degree terms, i.e., by 
using the partial sums. We expect the approximation to be better when z is closer to 0, or when 


we take a partial sum with more terms. 


But what if we want to approximate a polynomial when z is close to some « value other 
than 0? For instance, can we easily approximate the value of a polynomial when z is close to 


—2°? Or when = is close to an arbitrary x value a? 


The answer is : yes, and we can do it in exactly the same way that we approximated when 
xz was close to 0, provided that the polynomial is written in terms of powers NOT of 
x, but of (« — a). 


For example, let’s look at q(x) = 7+3(a+2)—5(x+2)*. Just as with powers of x, when the 
quantity (x + 2) is close to 0, higher powers of (x + 2) are closer to 0, and are less significant. 


Hence, when (a + 2) is close to 0, we might use either approximation 


g(a) + 7, or g(a) ® 74+3(a+2). 


Note that saying that (2 +2) is close to 0 is the same as saying that x is close to —2 and, more 
generally, saying that x is close to a is the same as saying that (a — a) is close to 0. This is why, 
if we want to approximate a polynomial well for values of x near some fixed value a, we want 


our original polynomial to be written in powers of (x — a). 
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We want to adopt terminology for polynomials written in terms of powers of (a — a), and 


also for their partial sums. 


Definition 4.1.4. A polynomial centered at a is a polynomial written in terms of powers 
of (c — a) (or with some other variable in place of x), i.e., a polynomial of the form 


where, if ba # 0, d denotes the degree of q(x). Hence, a polynomial written in terms of 
powers of x is centered at 0. 


The n-th order partial sum, q"(x), of such a q(x) is the sum of the first n terms of q(x), 
i.e., all of the terms of q(x) from bo to b,(x — a)". Hence, 


In particular, ifn > d, then q"(x) = q(a) 


Note that the polynomial function defined by a polynomial centered at a is the same function 
that you have if you expand the polynomial and write it in terms of powers of x. The point is 
that the polynomial function does not “care” where it’s centered. This is the main reason that 
we sometimes distinguish, in this book, between a polynomial, as an algebraic expression, and 
the associated polynomial function. However, you needn’t worry about being confused; when 


we write something about a polynomial centered at a, it will be clear what we mean. 


The point (for us) of polynomials centered at a is: 


If g(x) is a polynomial centered at a, and «x is close to a, then the polynomial function q(z) 


can be approximated “well” by using the partial sums q” (x). 


Before we give an example of approximating a polynomial that’s centered somewhere other 
than 0, we will make a couple of definitions, so that we can more easily discuss the error in 


approximating by partial sums. 
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Definition 4.1.5. Suppose that q(x) = + 4 by(x — a)* is a polynomial centered at a. 
Then, the difference between q(x) and its n-th order partial sum q"(x) is called the n-th 
remainder, and is denoted R"(x). Thus, 


or, equivalently q(x) = q"(x) + R" (2). 


The error in approximating q(x) by g”(x) is E"(x) =|R"(ax)|, the absolute value of the 
remainder. Thus, g(x) approximates q(x) to within plus or minus the error, i.e., 


q"(%)— E"(x) < q(x) s 


Example 4.1.6. Let’s return to our example, above, of a polynomial centered at —2: 


g(a) = 7+3(2 +2) —5(a + 2)*. 


Suppose that x is within 0.1 of —2. What’s that maximum amount of error that we could have 
in approximating q(x) by q'(a)? 


Solution: 


To say that “x is within 0.1 of —2” is the same as saying “x is between —2 — 0.1 and 
—2+0.1", ie, that —2-—0.1 < « < —2+40.1. Adding 2 everywhere, we see that this is 
equivalent to —0.1 < «+2 < 0.1, which, in turn, is equivalent to 


ja+2| < 0.1. 


The error in approximating g(a) by q'(x) = 7+ 3(# + 2) is 


E*(x) = |R'(x)| = |g(z)—a"(a)| = |-5(@+-2)*] = Sle +2/%. 


Therefore, if ja +2| < 0.1, then we have the following upper bound on the error in 
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approximating q(x) by q'(): 


E\(2) = 5la+2|* < 5(0.1)* = 0.0005. 
It is important to note that this is an upper bound on the error in the approximation, 


given that all that we know about z is that |a +2] < 0.1. Of course, if we know that some 
particular x is even closer to —2, then the error E+(zx) will be smaller than 0.0005. 


At this point, we know that, when z is close to a, we can approximate a polynomial q(z), 
which is in terms of powers of (a — a), by using the partial sums g”(a). But what do we do if we 
are given a polynomial centered at one point, say, centered at 0, and we want to approximate 


the values of the polynomial function when x is near some other point a 4 0? 


What we need to be able to do is start with a polynomial function p(a), of degree d, and 
figure out how to rewrite p(x) as a polynomial q(x) centered at any point a that we desire. That 
is, given p(a) and a, we need to be able to produce the coefficients b; so that 


p(x) = bp + bi(a — a) + bo(x — a)? + b3(x@ — a)? +--+ + bg(x — a)? = S 7 bg (x — a)*. 


Before we describe what happens generally, let’s look at an example. 


Example 4.1.7. Suppose that, once again, p(x) = 4— 2x +527+92°, and we want to approx- 
imate p(x) by some lower-degree polynomial, so that the approximation is good when = is close 
to 1. 


What we need to do is rewrite p(a) as a polynomial q(x), which is centered at 1, and then 
use the partial sums of q(x) for our approximations. Thus, we want to find bo, bi, be, bs, ba, 
and bs so that 


bo + b1(@ — 1) + bo (a — 1)? + b3(a — 1)? + ba(a — 1)4 +.b5(a@ —1)° = p(z). (4.1) 


We could (but won’t) approach this in a purely algebraic fashion: you expand all of the terms 
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on the left, collect the terms with the same powers of x, and match the coefficient of each x” on 
the left with the coefficient of «* in p(x). The coefficients on the left will involve all of the by, 
and you will end up with 6 (linear) equations and 6 unknowns. These equations will actually 
be easy to solve: you first find bs, then by, is easy to determine after that, then b3, and so on. 


However, Calculus gives us a much easier, faster way to determine the b;. First, plug in 
x = 1 on both sides of Formula [4.1] and note that all of the powers (2 — 1)*, where k > 1, 


become 0. Therefore, we obtain quickly that 


Now, differentiate each side of Formula|4£-1] to obtain 


by + 2bo(a@ — 1) + 3b3(a — 1)? + 4b4(@ — 1)? + 5b5(@ — 1)* = p(x) (4.2) 


and, once again, plug x = 1 into both sides of the equation; all of the powers of (x — 1) are 0, 


and we find 
bb = pL. 


You may guess at this point that b, always equals p“)(1), the k-th derivative of p at 1. This is 
not correct. 


Differentiate again (differentiate Formula}4.2), to obtain 


Qbe + 2+ 3b3(a — 1) +3- 4b4(x — 1)? +4: 5bs(a@ — 1)? = p(x) (4.3) 


and plug in x = 1 to find 


Hmmmm...what is the pattern here? Let’s differentiate more times, and always plug in x = 1. 
You'll see what happens. We find: 


2+ 3b3 +2-3-4ba(x — 1) +3-4-5bs(@ — 1)? = p’”(z), 


2-3-4bg+2-3-4-5b5(z—1) = p(x), 
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2-3-4-5b; = p)(a), 


and so ) 5) 
eae) See) ea) 
b 3! 5 ba = Al ; and bs = 5! ; 
where, for n > 1, p/ denotes the n-th derivative of p, and n! = 1-2-3----n is n factorial. 


For n = 0, there are special definitions; p© is the 0-th derivative of p, and so is just p itself (p 
differentiated zero times), while 0! is defined to be 1. 


With these notations, what we have seen is that, for all k, we have the general formula 


p*)(1) 


ar 


For our particular p(x) = 4 — 2x + 5x? + 92°, we find 


p(w) = -24+10¢2+5-924, p"(x) = 104+4-5-923, pO (x) = 3-4-5-92?, 


p(x) = 2-3-4-5-92, and p(x) = 2-3-4-5-9. 


Therefore, 
/ 1 MW 1 
bo =p(1) = 16, by) =p'(1) =53, by =v ) = 95, b= ” = 90, 
(4) (5) 
ped) _ ee ee 
“= 5.4.45 and ae ee 


Finally, we find 
p(t) = 4—-224+52? 492° = 


16 + 53(a@ — 1) + 95(a — 1)? + 90(a — 1)3 + 45(@ — 1)4 + 9(a — 1)°. 


So, how do you produce a reasonable approximation of p(x), when x is close to 1? Use the 
partial sums of the polynomial centered at 1, e.g., 


164+53(¢@—1) or 16453(2-—1)+95(¢—1)? 


would provide reasonable lower-degree approximations of p(x), when w is close to 1. 
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The process that we went through in the above example works in general. We conclude: 


Proposition 4.1.8. Suppose that p(x) is a polynomial function of degree d, and that a is 
a fixed number. Then, there is a unique polynomial, of degree d, 


centered at a, such that there is an equality of functions p(x) = q(x); that polynomial is the 
one which has coefficients given by 


p**) (a) 
kl 


by = 


Example 4.1.9. Rewrite the polynomial p(x) = (x — 2)? as a polynomial centered at —1, and 
use the 2nd partial sum of this new polynomial to approximate p(—0.9). 


Solution: 


In order to use Proposition 4.1.8} we need to calculate the derivatives of p at —1. We find 


p (a) = 3(x = ay", pS = 6(x = 2), and p(x) = 6, 
and so 


Therefore, Proposition tells us that 


(c— 2)? = 27 + 27(@ +1) + 8 (@ +1)? +O (e+ 1) = 
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The approximation 


(a—2)? ~ —27+27(2+1)—-9(2+4+1), 


applied to x = —0.9 yields 


p(—0.9) = (-2.9)? » —27+27(0.1) — 9(0.1)? = —24.39. 


The actual value of (—2.9)? is —24.389; so our approximation is pretty good. 


4.1.1 Exercises 


In Exercises 1 through 6, you are given a polynomial p(x), centered at some a, and 
values xo near, or equal to, the center. For each x9, determine p(xo), and the values 
of the 1st and 2nd order partial sums of p(x), evaluated at xo. In each case, find the 


error in approximating p(x.) by the 1st and 2nd partial sums. 


1. p(x) = 4-32 +52? — 923 + 1002*; xp = 0, 0.1, 0.01, —0.001. 

2. p(x) = 4—3(a@ +2) +5(x +2)? —9(@ + 2)3 + 100(a@+2)*; xp = —2, —1.9, —1.99, —2.001. 
3. p(x) = 7+ (a — 1) +12(a — 1)? — 15(a — 1)®; zp = 1, 1.1, 1.01, 0.999. 

4. p(x) =7+2+ 122? — 1526; xp =0, 0.1, 0.01, —0.001. 

5. p(x) =1+ata2?+a34+24+2°; xo =0, —0.01, 0.001. 

6. p(z) =14+(e@—m)+(x@—-7)? +(a-—7)3 +(a@—m7)4+(x@—7)°; xo =7, r—0.01, 7 +0.001. 


7. Let p(z) = 4 — 3x + 5x? — 923 + 1002*. When z is close to 0, what power of x best 


approximates 


= (P(2) —4+ 32x)? 


8. Let p(x) = 4—3(x +2) +5(x +2)? — 9(x + 2)? + 100(2 + 2)*. When z is close to —2, what 
power of (a + 2) best approximates 


(p(x) — 4+ 3(@ + 2))? 


Ol Re 
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9. Let p(x) = 7+ (a — 1) + 12(x — 1)? — 15(a — 1)®. When z is close to 1, what polynomial 
of the form c(x — 1)” best approximates p(a) — 7 — (a —1)? 

10. Let p(x) = 7+ (a — 1) +12(@ — 1)? — 15(2 — 1)®. When z is close to 1, what polynomial 


of the form c(a — 1)" best approximates the derivative [p(x) — 7 — (x — 1)] 9 


In each of Exercises 11 through 15, rewrite the polynomial q(x) as a polynomial 


centered at the given a. 


11. q(z) =2?+52+6,a=-2. 


12. 


13. 


14. 


15. 


q(z) =1+24 


g(a) =1-<24 


| Qe? + 3a 


| Qa? — 3a 


q(x) = (x + 2)3 — (x +2) 4+1,a=2. eo 


q(x) = @° + 15x? + 75a + 125,a=5. 


8 + 4a*, a=1. 


34 Ag? g=1. 


In each of Exercises 16 through 20, use the (re-centered) 1st order partial sum from 


your answers to Exercises 11 through 15 to approximate q(x) at the given value. 


. Approximate g(x) = x7 +52+6 at x = —2.1. 


. Approximate q(x) = (a + 2)? — (v7 +2)+1 at x = 2.05. 


. Approximate g(x) = 2° + 15x? + 75x” +125 at x = 4.99. 


. Approximate g(x) =1+2 + 2274 323 + 424 at c= 1.1. 


. Approximate q(x) = 1 


x + 2a? — 32° + 404 ato = 1.1. 


In Exercises 21 through 25, use the method from Example to calculate an 


upper bound on the error in approximating q(x) by q‘~!(x), where d is the degree 


of qg, for x values within the given amount of the center. 


21. 


22. 


23. 


24. 


q(x) = 34+ 4(x + 2) — 6(x + 2)?, x is within 0.2 of —2. eo 
q(x) = 4x4 + 323 + 227 +. 2 +1, x is within 0.1 of 0. 
q(x) = 5(x — 3)° — 7(2 — 3), x is within 0.01 of 3. 


q(x) = 6(x — 1)? — 5(2 — 1) +7, x is within 0.05 Of 1. 
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25. q(x) = 12(2 + 4)4 + (2 + 4)8 — (x + 4)? + (x +4), « is within 0.3 of —4. 


26. a. Let p(x) be a polynomial (centered at 0) which contains only even powers of x. 
Suppose bo, ...,ba are the coefficients of p(a) centered at a and that co,...,cq are the 
coefficients of p(a) centered at —a. Prove that b; = c; when 7 is even, and b; = —c; 


when 7 is odd. 


b. State and prove a similar conclusion when p(x) contains only odd powers of 2. 
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> 
4.2 Approximation of Functions by Polynomials ee" 


In the previous section, we discussed approximating polynomials p(x), centered at some point a, 
for values of x close to a, by using the first few lowest-degree terms, i.e., by using partial sums. 


In this section, our goal is to investigate approximating a non-polynomial function f(x), by 
using a polynomial function p(x) and, if we want the approximation to be good for x near a, then 
we will use a polynomial which is centered at a. Why do this? Because polynomials are our 
friends; we can easily evaluate polynomials by hand (if we have to), because all that’s involved 
is raising to powers, multiplying by constants, and adding (or subtracting). We understand 
polynomials, and if we can use them to accurately approximate other functions, then we can 


say, in some serious sense, that we understand the functions being approximated. 


In this section, we will give the intuitive idea, and look at examples. In the next section, we 
will investigate how you can place some rigorous bounds on how good/bad the approximation 


may be. 


Suppose that we have two functions f(x) and p(x), which are both n times differentiable, 
for some integer n > 0. We’re thinking of f(a) as a non-polynomial function, and of p(x) as a 
polynomial centered at a, but, for now, they could be any two functions which are differentiable 


enough. 


If we want p(x) to provide a good approximation of f(x), when « is close to a, what should 
we want to be true? Well, first of all, we can require that at a, f and g have the same value, 
ie., require that f(a) = p(a); to make this fit in with what we will see later, we also say that f 
and p have the same 0-th derivative at a. This is equivalent to saying that the graphs of f and 
p intersect at the point (a, f(a)) = (a,p(a)). As f and g are differentiable, they’re continuous, 
and so, requiring them to be equal at a forces them to not be too far apart if x is close to a. 
But can we do better? 


How about if we continue to require f(a) = p(a), and also require that f and p are changing 
at the same rate at x = a? This certainly seems like it should force f and p to be close to 
each other for x near a. This new condition is simply that f’(a) = p’(a), and, combined with 
requiring that f(a) = p(a), means that the graphs of y = f(x) and y = p(x) have the same 
tangent lines at the point (a, f(a)) = (a, p(a)). 


But two graphs which intersect at a point, and have the same tangent line there, may not 
have the same concavity. Wouldn’t requiring that the two graphs have the same tangent line 


and concavity at the point (a, f(a)) = (a, p(a)) force the values of the two functions to be even 
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closer together when zx is close to a? This would mean that we would require f(a) = p"(a). 


Hopefully, by now, you can see where we’re heading. The general principle (which doesn’t 
always work) is that, if we want for f(x) and p(x) to be really close to each other, when x is 
near a, then we could/should force f and p, and as many of their derivatives as we wish, to be 


equal at a. 


In Figure [4-1] the function whose graph is in black has the same 0-th and 1st derivatives at 
x = 1 as the other three functions whose graphs appear in red, green, and blue, but only the 
functions whose graphs are black and blue also have equal 2nd derivatives at 1. As you can see 
the black and blue graphs are very close together near x = 1. 


10 + 


Figure 4.1: Graphs of four functions with some matching derivatives at « = 1. 


Now we want to assume that f(x) is n times differentiable at a, and we want to determine 
a polynomial p(x) = > 779 ck(z — a)*, centered at a, which approximates f(x) well, when z is 
close to a. From our discussion above, we sce that we want f‘*)(a) = p\)(a), for all k such that 
0<k<n. The good news is that this completely determines the coefficients cz of p(x) in terms 


of the derivatives of f at a. 


We will now derive a formula for the cz, and the discussion will look just like that in Ex- 
ample |4.1.7| and our conclusion will look very similar to Proposition |4.1.8| though we cannot 
possibly have an equality of f(a) and a polynomial approximation p(x), if f itself is not a 


polynomial function. 
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Theorem 4.2.1. Suppose that f(x) is a function which is n times differentiable at a, and 
that p(x) = Sop ce(w — a)* is a polynomial, centered at a, such that, for all k such that 
0<k<n, there is an equality of derivatives f(a) = p“) (a). Then, 


Proof. The real proof of the formula is by mathematical induction, but the argument that we 


give should be completely convincing. 
We want to see that: 
n 


if p(w) = S>cg(a—a)*, then p\(a) = (keg. (4.4) 
k=0 


For, then, requiring f‘*)(a) = p“(a) forces us to have the equality f)(a) = (k!)cx, and 
so cy = f(a) /(k!). 
So, we need to see why Formula[4.4]is true. We have 


p(z) = co +c1(x — a) + co(x — a)? +3(4 — a)? + ca(z — )* +--+ + en (2 — 2)”. 


Hence, 


p(x) = c, + 2co(# — a) + 3c3(x — a)? + 4ea(x2 — a)? +++ + nen(a — a)" ", 


p(x) = 2c2+ (2-3)e3(a — a) + (3+ 4)ea(e — a)? +--+ [(2 — 1)nlen(a — a)", 


p' (2) = (2-3)c3 + (2-3-4)ca(x — a) +--+ + [(n — 2)(n —1)nJen(z — a)”-3, 
and so on. What you can see is that the constant term of p“) (x) is (k!)cz, and all of the other 


terms, all of the positive powers of (a — a), are zero when x = a. Formula follows, and we 


are done. 


We make the following definition. 
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Definition 4.2.2. Suppose that f(x) is a function which is n times differentiable at a. 
Then, the polynomial 


T7 (x3) = se _ a —a)* = 
k=0 : 


, 7) 2, f'(@) 


f(a) + f'(a)(x — a) aI (x — a) 3i (n—a)? +--- 


is called the n-th order Taylor polynomial of f, centered at a. 
When a = 0, we write simply T? (2) in place of T? (2; 0), and refer to the polynomial 
‘ “\ f (0) 
rye) = AO, - 
k=0 : 


LO 2, £0) 


f+ FOr+ 5 ‘i 


as the n-th order Maclaurin polynomial of /. 


Remark 4.2.3. We want to emphasize that, for all n, T? (a; a) = f(a); that is, the Taylor 
polynomials always give the actual value of f at x =a. 


Note that the n-th order Taylor polynomial will have degree n, provided that f(a) 4 0; for 
this reason, the n-th order Taylor polynomial is sometimes referred to as the degree n Taylor 


polynomial, without worrying about whether or not f(”)(a) is zero. 


The point of our discussion up to this point is that: 


Note that our previous discussion does not prove that for n big enough, T? (2; a) approxi- 


mates f(a) well, for 2 near a; we will discuss bounds on the error in the approximation in the 
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next section. In the remainder of this section, we will calculate some specific Taylor /Maclaurin 
polynomials, and compare the approximations that we get from them with the “actual values” 
of f(x) that we get from calculators. This is somewhat circular, since modern computers and 
scientific calculators, with floating point units, use polynomial approximation to give you highly 


accurate results. 


Remark 4.2.4. Note that the Ist order Taylor polynomial T;(2;a) = f(a) + f'(a)(x — a), 
centered at a, is precisely the linearization of f at a, Ly(x;a), which we discussed in [2]. Hence, 
the approximation 

f(x) ~ T}(x;a) 
is the linear approximation of f(a), when wx is close to a. 


As we shall see, using Taylor polynomials of order higher than 1 typically yields a better 


approximation than you obtain by using merely the linearization. 


Example 4.2.5. Suppose that f(x) = e*. Let’s find T7'(x), the n-th order Maclaurin poly- 
nomial of f, and use the 1st and 3rd order Maclaurin polynomials to approximate e°!. we'll 


compare these approximations with the calculator value of e%!. 


The coefficient c, of the «* term in TH (x) is ch = f(0)/(k!). So, we need all of the 
derivatives of f(x) at 0. But all of the derivatives of e” are e”, which has value 1 when x = 0. 


Therefore, cy, = 1/(k!), and the n-th order Maclaurin polynomial of e” is 


In a way, this should be very unsurprising; in [2], we defined exp(x) = e* to be the limit 


of the polynomials 


Pes Ae Se 


This leads to a general question which we will address in later sections: when is a function f(z) 


the limit, as n — oo, of its Taylor polynomials T}' (x)? 


For now, however, we just want to investigate how well T;(0.1) and T?(0.1) approximate 


e°-! (where, in this section, we are going to rely on our calculators for highly accurate values). 
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We shall use an equality sign in quotes to denote “equal, according to the full precision of (our) 


calculators” . 


Your calculator should tell you that 


e%! “ =” 110517091808, 


and we find that 


while 
0.1)? 0.1)8 


1 3 OF 1.1051666. 


T?(0.1) = 1+0.14 


As you can see, the approximation by TF is fairly good, but the approximation by T; is 


much better. 


Before we go on, it will be convenient to adopt some new notation. Rather than write the 
n-th order Taylor polynomial for arbitrary n, it’s nice to use some notation which implies that 
there is no particular n at which we need to stop. Thus, for example, when f(x) = e*, rather 


than writing that, for all n, 


a = = = { = a LL ees - 
k=0 
we instead write 
love) _ oP x? x? 
Te) = Te) = DU = ltet+gtat: 
k=0 : : 


We use the notations T;(x) and T?°(x) interchangeably. We are not claiming (yet!) that 
this infinite sum defines a function. For now, it is formal notation that simply indicates, for 


arbitrarily large n, what T? (2) is. 


More generally, we make the definition: 
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Definition 4.2.6. A power series, centered at a, is a formal algebraic expression 


co 
S > cx (a — a)* = cote (2—a)+e(x— a)? +03(2—2)? +--- , 
k=0 


where the cy, are constants. 


Suppose that f(x) is a function which is infinitely differentiable at a, i.e., such that the 
n-th derivative f'") (a) exists for alln. Then, the power series T;(2;a) = T?°(x;a), centered 
at a, given by 


2 (hk) (gq 
P(e) = TRea) = SO, 
k=0 


fla) + F(ay(@— a) + 79 (ea)? + Me a) + -- 


is the Taylor series of f, centered at a; when a = 0, this is also referred to as the 
Maclaurin series of /. 


Remark 4.2.7. Once we have the notion of a Taylor series, it is natural to think of the n-th 


order Taylor polynomial as the n-th order partial sum of the Taylor series. 


We also want to state again that, at this point, we have not defined power series functions. 
Just as there is a difference between a polynomial as an algebraic object and as a polynomial 
function, there is a difference between a power series as an algebraic object and as a function 


(assuming that we can even make sense of the infinite sum as a function). 


Example 4.2.8. Find the Taylor series for f(x) = sinw, centered at 7/2, and use the first 
two non-zero terms to estimate sin(1.5). Compare your estimate with the calculator value of 
sin(1.5). 


Solution: First, it is important that all angles are in radians, as is always the case in a Calculus 


course, unless you are explicitly told otherwise. 
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The Taylor series for f(a), centered at 7/2 is the power series 


FS) 

k! 

Thus, as is always the case when finding Taylor series, the problem boils down to calculating 
f(a), which here is f) (4), for all k. 


where cy = 


The derivatives of sine repeat every four times you take a derivative: 


f(z)=sina, f'(a)=cosax, f"(xc)=—sinz, f”(c)=-cosxz, f(x) =sinag, 


and, now that we’re back at sinz, the derivatives just keep cycling. Therefore, the derivatives 
(including the 0-th), evaluated at a = 7/2, are 


1, O S8,-0: te 0, SA th 32 


and so, the Taylor series, centered at 7/2 is 


T 

Hopefully, you can see the pattern here. We have only the even powers of (x - =) and, for 
each term, we divide by a factorial that matches the exponent, and the sign alternates between 
+ and -. 


For many purposes, the formula above for T's (e z), in which the pattern is clear, is good 
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enough. However, frequently, we want to write a formula for cz, regardless of what k is, so that 
we can write one “nice-looking” summation formula for the power series. But, in some cases, 
like our current one, we don’t need to produce a formula for every cx; we really only need a 
formula for the coefficients of the non-zero terms. In this example, the non-zero terms are the 


even-powered terms, and we would like a formula for c2,, where k can be any integer > 0, and 


then we will use a summation that runs through only the even powers of (2 a z) in the first 
place: 
a 2k 
Ty (a5) = Deew(e-5) = 
T\2 ar\4 T\6 T\8 a \ 10 
c+ (e-5) +a(e-5) +e(e-5) +e (e-5) ta0(e-5) +. 


So, now the question is: can we write a nice formula for co,? Let’s see, we have 


1 1 1 


@20=Co = 1, C2.1 re ae 


We can see that the pattern is that 


Cok = Bay, 


where the sign alternates from one even-subscripted coefficient to the next, but can we write 
an algebraic formula for this alternating sign? Yes. The standard “trick” for this is to look at 
(—1)* or (-1)**1 = (—1)*71; for (—1)* equals 1 when k is even, and —1 when k is odd, while, 


on the other hand, (—1)**++ equals 1 when k is odd, and —1 when k is even. 


Therefore, looking at our coefficients, we see that we want 


one Slane 
so that x 
rie 1 T\ 2k 
ae (2 ) ° d 1) a (« a) = 


1-5 (2-5) +5(e-§) -G (2-9) +g @-5) - Ge) 
1 — (x Peas 
a\*~ 9 7 D 6 \” 2 gi 2 10! \" 2 


We were told to use the first two non-zero terms of the Taylor series to approximate sin(1.5). 


Before we do this, we should answer the question: why would we expect this approximation to 


4.2, APPROXIMATION OF FUNCTIONS 417 


be any good? The answer: because 3 is close to 7, so that 3/2 = 1.5 is close to 7/2. As our 
x value, 1.5, is close to the center, we expect that the Taylor polynomials will provide a good 


approximation. 


The polynomial that we get from the first two non-zero terms of the Taylor series is 


1 T\ 2 
ae) 1 He-8 
F\") 5 ae a)? 


which is the same as T? (a; z), since the cubed term is missing (i.e., has a coefficient of 0). 


A calculator tells us that 


i (15: =) ade (1.5 = my “ —” (997493940056. 


Comparing this with the calculator value of 


sin(1.5) “=” 0.997494986604, 


we see that the approximation by the 2nd order Taylor polynomial, centered at 1/2 is accurate 
(according to our calculators) to within 0.000001046548. Pretty good! 


Let’s look at sinx again, but center now at 0. 


Example 4.2.9. Find the Maclaurin series for f(#) = sin, and use the first two non-zero 
terms to estimate sin(0.1) and sin(1.5). Compare your estimates with the calculator values. 


Solution: 


There is no question what we need to do; if f(z) = sina, then 


ne £lk) 
k=0 : 


and our problem is to find f‘*)(0) and, ideally, to find a nice formula for at least the non-zero 


derivative at 0. 
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We saw in the previous example that the derivatives of sine repeat: 
f(z) =sine, f'(z)=cosz, f"(r)=—sing, f'(x)=-cosa, f(x) =sinz, 
and so, the derivatives (including the 0-th), evaluated at a = 0, are 
0, 1, 0, —1, 0, 1, 0, —1, 0, 1, 0, - 


and so, the Maclaurin series for sin x is 


1 1 
T;(z) = 0+ 1-2 + 0-2? ge Oa ea ar a OK pe ee 


We should remark that the fact that sine is an odd function and that the Maclaurin series 
for sin x has only odd-powered (non-zero) terms is not a coincidence; it is an exercise for you to 
show that, evaluated at 0, the even derivatives of odd functions are zero, and the odd derivatives 


of even functions are zero. 


Though we shall not need it, we leave it to you to verify that 


g2ktl 


= LOD" Gea 


k=0 


The first two non-zero terms of T(x) correspond to the 3rd order Maclaurin polynomial, 
T}(a), which, in fact, is the same as T#(2). 


Hence, we were told to use the approximation 


when x = 0.1 and when x = 1.5. 
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When x = 0.1, we find that 


3 
T? (0.1) = 0.1—-~— = 0.0998333, and _— sin(0.1) “=” 0.099833416647, 
which is a very close approximation. 
On the other hand, when x = 1.5, we have 


(Ly 
6 


T? (1.5) = 1.5- = 0.9375, and — sin(1.5) “=” 0.997494986604, 


which is not a particularly close approximation. 


It should not be surprising that the approximation when x = 1.5 is not as good here, or that 


the approximation when x = 1.5 in the previous example, where we were centered at 7/2, was 


much better. The value « = 0.1 is relatively close to 0.1, while 1.5 is not. On the other hand, 


1.5 is very close to 7/2. 


Example 4.2.10. If x is close to 0, then 6(a —sin x) is best approximated by what power of x? 


Solution: 


In fact, we cannot really justify our answer to this until we have some bound on the error 


in the Taylor approximation, which we won’t look at until the next section. 


However, we expect that, when z is close to 0, f(a) = sina is close to its Maclaurin poly- 


nomials, which we can read off of the Maclaurin series from the previous example. Hence, we 


expect that 


foal g? at h g2n-l 
Fi ay 7 +1 
Se RS Sp ee gy ee oa 
Then, performing some easy algebra, we expect that 
; fo 7? ae 2 gen 
NE a gy ig ee NE pay) 
and 
3 6x° 6x" a 


6(a@—sinz) & « | 4 
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When z is close to 0, we know that the x® term on the right is the most important; the other 


terms will be extremely close to 0. 


Therefore, when x is close to 0, we expect that 6(x — sina) is best approximated by z°. 


We conclude this section by listing a few important Maclaurin series, some of which we’ve 


derived, and some of which we leave as exercises. 


Theorem 4.2.11. If f(x) = e* 


If f(x) =sinaz, then 


= 0 


k=0 


If f(a) = cosa, then 
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Remark 4.2.12. Note that sin is an odd function, and that its Maclaurin series contains only 
odd-powered terms, while cos z is an even function, and that its Maclaurin series contains only 


even-powered terms. These are not coincidences. 


In Exercise[37| we outline how you show that the Maclaurin series of an odd function contains 
only odd powers of x, and that the Maclaurin series of an even function contains only even powers 


of x. 


If f(x) =In(14+ 2), then 


k+1 


k+1 


T;(e) = S“(-1)* = 


k=0 


Remark 4.2.13. Because we ended with a list of Maclaurin series, you may have gotten the 
impression that Taylor/Maclaurin series were the most important topic in this section. This is 
not the case. We introduced Taylor and Maclaurin series here simply as a convenience; referring 
to the series enables us to omit any reference to the final term in a Taylor polynomial, and to 


leave out the phrase “for all n”. 


The main point of this section is simple: the n-th order Taylor polynomial, centered 


at a, of f(x) should approximate f(x) very well, when x is close to a and n is big. 
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4.2.1 Exercises 


In Exercises 1 through 5, (a) use the five Maclaurin series given in Theorem|4.2.11|to 
determine the 1st, 2nd, and 3rd order Maclaurin polynomials of the given function 
f, (b) using a calculator, compare the values of f(2) with the values of T; (2), T; (2), 
and T? (2), and explain why your results are not surprising, and (c) using a calculator, 
compare the values of f(0.001) with the values of T; (0.001), T7(0.001), and T?(0.001), 


and explain why your results are not surprising. 


w 
SB OH SH BR OS 
oh 
lI 
° 
ie} 
mn 
8 


In each of Exercises 6 through 10, (a) determine the 2nd order Taylor polynomial, 
centered at the given a, for the given function, (b) using a graphing calculator or 
computer graphing program, graph, in a single window, the original function and 


T;(«;a), for values of x “close” to a. 


6. p(x) = (« — 1)? + (2 — 1)? + (x - 1), a=0. 


7. g(x) =sinhaz,a=0. 


8. h(x) = cosha, a = 0. eo 


10. r(x) = (14+ z)*, a=3. 


In each of Exercises 11 through 13, find the 3rd order Taylor polynomial, T?(a;a), 


for the given f and the given a. 


ll. f(x) = VYa,a=1. 


12. f(z) = V1+2?,a=0. 


1. fle) Sloe) a= 1, eo 
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In Exercises 14 through 18, (a) calculate the specified T?(x;a) and (b) the error 


between the Taylor approximation and the value given by your calculator. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


f(x) =sinz. 


37 
3 3 


b. |sin(4.7) — T? (47 =) | g 
g(x) = sinha. 

a. T3(0.1;0) 

b. |sinh(0.1) — T3 (0.1; 0)| 
h(x) = Ing. 

a. T2(0.9; 1) 

b. |In(0.9) — T?(0.9; 1)| 
v(x) = seca. 


a. T2(—0.1;0) 
b. |sec(—0.1) — T?(—0.1; 0)| 


w(a) = cosa. 
_ T2 (1.6; 5) 
a. Ty, (1.6; 9 


b. [cos(1.6) ~ 72 (1.6: -)| 


What power of x do you expect to approximate sin well, when x is near 0? That is, if 


you want to approximate sin x by a function x”, for x values close to 0, what should n be? 


What power of x do you expect to approximate 6(x — sin) well, when x is near 0? That 
is, if you want to approximate 6(# — sinx) by a function x”, for x values close to 0, what 
should n be? 


If you want to approximate 1 — cosz well, by a function of the form cx”, when z is close 


to 0, what should you pick for the constant c and the positive integer n? 


Using a graphing calculator or graphing software, graph f(z) = 1— cosa and g(x) = ca”, 
using your c and n values from the previous exercise. Graph the functions in the same 
window, with -1<a2<1land—-0.5<y< 0.5. 


424 


CHAPTER 4. POLYNOMIALS AND POWER SERIES 


In each of Exercises 23 through 33, determine the Taylor series for the given func- 


tion, centered at the given a. Give the Taylor series using summation notation, or 


by giving the first 5 non-zero terms, followed by dots. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


sce" aa L 
u(t) = sint + cost, a = 7/2. 


k(t) = cos(2t), a = 7/4. 


k(x) = cosh(2x), a = 0. 
r(t) = sinh(2t), a = 0. 


f(t) =In(1 +32), a =3. 


Describe in a paragraph or two the idea behind why the Taylor polynomials of f, centered 


at a, “should” approximate f(x) well for values of x close to a. 


What is T?'(x;a) — smc a)? Assume f is infinitely differentiable in a neighborhood 


of r=a. 


The approximation sinx + x, when z is close 0, is used often in physics and engineering. 


Where does this come from? 


If you wanted a better approximation of sinz, when z is close to 0, and were willing to 
use a polynomial with two non-zero terms (instead of one), what polynomial would you 
use? Why? 


In this exercise, we give the steps which show that the Maclaurin series of an even or odd 


function contain only even or, respectively, odd powers of x. 


a. Suppose that f is an odd function, which is defined at 0. Show that f(0) = 0. 


b. Show that the derivative of an even function is odd, and that the derivative of an odd 
function is even. Conclude that, for infinitely differentiable functions, even-numbered 
derivatives of odd functions are odd, and odd-numbered derivatives of even functions 


are odd. 
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c. Show that, if f is infinitely differentiable and defined at 0, then the only (possible) 
non-zero terms in the Maclaurin series of f are the even-powered terms if f is even, 
and the odd-powered terms if f is odd. 


38. Derive the Maclaurin series for cos xz given in Theorem |4.2.11 
39. Derive the Maclaurin series for In(1 + x) given in Theorem |4.2.11 


40. Derive the Maclaurin series for 1/(1 — x) given in Theorem|4.2.11 
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> 
4.3. Error in Approximation by Polynomials 


In the previous section, we introduced Taylor polynomials, centered at a, of a function f(x). 


We tried to convince you, on an intuitive level, that the n-th order Taylor polynomial should 
approximate the original function well for values of x near the center when n is large, because 
we forced n derivatives of f and the Taylor polynomial to match at a. In this section, we will 


actually give a result on the error involved, and show how to apply it. 


As in Section we wish to give names and notations for the remainder and error when 


approximating a function by its Taylor polynomials. 


Definition 4.3.1. Suppose that f is n times differentiable at a. 
Then, we let 
Re(aja) = f(x) — TH(a;a) 


and refer to it as the n-th order Taylor remainder for f, centered at a. 


We let EF (x;a) = |R(x;a)|, and refer to it as the n-th order Taylor error for f, 
centered at a. 


Ifa =0, we naturally refer to these as the n-th order Maclaurin remainder and 
Maclaurin error for f. 


Remark 4.3.2. As T;'(a;a) = f(a) for all n, the n-th order Taylor remainder and error are 
both zero at « =a, ie., E?(a;a) = |RF(a;a)| = 0. 


We now state and prove the one result that we shall use to establish upper bounds on the 
error in approximating a function by its Taylor polynomials: the Taylor-Lagrange Theorem. 
The theorem is a generalization of the Mean Value Theorem for derivatives. Various forms of 
this result are known simply as Taylor’s Theorem. The result that we give is actually Taylor’s 
Theorem with Lagrange’s form of the remainder, and so we refer to the theorem as the Taylor- 


Lagrange Theorem. 
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The condition in the Taylor-Lagrange Theorem that x ¥ a is present because the theorem 
refers to a number c that’s in the open interval between x and a. This does not mean that we 
don’t know what happens when x = a; remember: when x = a, we have already seen that all of 


the Taylor remainders are 0. 


Theorem 4.3.3. (Taylor-Lagrange Theorem) Suppose that n is a non-negative integer, 
and that the (n+ 1)-st derivative of f exists on an open interval I around a point a. 


Then, for alla 4a in I, there exists ac in the open interval between x and a (i.e., in 
(x, a) ifx <a, or in (a,x) ifa< 2x) such that 


In particular, for alla in I (even if x =a), there exists c in the closed interval between 
x anda such that 


(asa) = 


Proof. This proof is technical. However, as this result is the entire point of this section, we will 
give the proof here, rather than relegating it to an appendix. 


Assume that x 4 a. Define the function g(t), with domain I, by 


he mn e(k) 
att) = F(a) - RA(ea) I~ ey. 
k=0 , 


Since f is (n + 1)-differentiable on J, g is differentiable on J, and so is certainly continuous on 
the closed interval between x and a. One easily sees that g(x) = g(a) = 0. By Rolle’s Theorem 
(see [2]), there exists c in the open interval between x and a such that g’(c) = 0. 

‘uated 
(n+1)! 


We claim that this shows that R#(x;a) = az —a)"*!, as desired, but we need to 
calculate g’() in order to see this. 


We find, using the Product Rule inside the summation, 


g(t) = 
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—R"(«;a) (n+ 1)(a — t)"(-1) > 1 ; [s+D (eV(@ - 1! + PO GKa= HP 11) = 
k 


(c—a)r+1 = 
eat" ty pet) eh) , 
(n +1) RP (a; a) ss > f a (t) (x ty », ite i : 


Let 7 =k—1, so that k =7+1, and use 7 to re-index the last summation above. We obtain 


g(t) = (n+ IRF (a; a) 


(Ci A, La Ua dled Oe 


(x —a)rtt | = = y! 


Now, in these two summations above, one with a plus sign, one with a minus sign, all of the 


terms cancel out, except for one: —f("+ (t)(a — t)"/(n!). 


Therefore, we find that 


Since g’(c) = 0 and c is in the open interval between x and a, we know that « —c #0 and so, 


we conclude that 


which is what we needed to show. 


Remark 4.3.4. It shouldn’t be too difficult to remember the conclusion of the Taylor-Lagrange 


Theorem; the n-th remainder 


looks exactly like the (n + 1)-st order term in the Taylor series for f, except that there is this 
unknown c stuck into the (n + 1)-st derivative instead of the center a. 


As we shall see, to use this form of the remainder, we always need to determine an upper 
bound on the absolute value of f+ (c), an upper bound that does not contain a reference to 


the mysterious c. 
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You should note that the value of c can and, probably, will change when n changes and, for 
this reason, we could denote c by the more careful notation c,. However, as our goal is always 
to place an upper bound on f("+!)(c) which doesn’t have the c in it, there is no practical need 


to write c,, in place of c. 


Example 4.3.5. In Example}4.2.5| we looked at Maclaurin polynomials for f(x) = e*. We saw 
that T; (0.1) = 1.1 and T?(0.1) = 1.1051666, and we decided to trust our calculators, which told 
us that e°! “=” 1.10517091808. 


What if we don’t want to rely on our calculator? Can we use the Taylor-Lagrange The- 
orem to obtain upper bounds on the error if we approximate e°:! by the Ist and 3rd order 
Maclaurin polynomials? We could also ask what order Maclaurin polynomial we would need to 
use, according to the Taylor-Lagrange Theorem, to obtain the 11 decimal place accuracy of our 


calculators. 


Certainly e” is infinitely differentiable on the entire real line, so the hypothesis of the Taylor- 
Lagrange Theorem about being differentiable (n+ 1) times on an open interval J around a holds 
for all a and for I = (—oo, 00). 


Therefore, the error in approximating e?'! by the n-th order Maclaurin polynomial T? (2) is 
given by 


fO+D(6) 
(n+ 1)! 


n+1 
one" = ae [FFM (0)I, 


EF(0.1) = |R#(0.1)| = ae 


for some c in the closed interval [0,0.1]. Of course, all of the derivatives of e* are just e*. Now, 
f+) (ce) = e°, which is always positive, and so we have 


(0.1)"+1 €& 


ERO) = ay 


for some c in the closed interval [0,0.1]. But what do we do with this e° factor, when we don’t 


know what c is? 


The quick answer: we say that the e° factor is less than or equal to something which has no 
cin it. This means that we won’t know the exact error, but will be able to say that the error 


is no more than a certain amount, which is what we really need in most applications. 
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Since c is in the closed interval [0,0.1], and f(x) = e” is an increasing function (since it’s 


derivative e” is always positive), regardless of what c is, we know that e° < e®-!. Thus, we know 


" (Ol yers et 
E?(0.1) < ces ae 


At this point, your reaction may be something like “But...but...our whole goal was to find a good 


approximation of e°!, and now our upper bound on the error contains e°. Isn’t this circular?” 


No — because we can now use some fairly awful upper bound, that doesn’t have to be 
very accurate, for e®', and then conclude how big n needs to be so that our gross upper bound 
guarantees that E? (0.1) is really small. So, no, our argument will not be “circular”, but will seem 
close to it; we will use a gross upper bound for e®! to obtain arbitrarily good approximations 
of e°!. Cool, huh? 


How does this work? We will use that e°! < 2. Why is this true? Because certainly e < 21?! 
So, now we have that 
: (ory! (o.ayntt.2 
E"(0.1) < 


= att! = @Hty #5) 


Note that our upper bound on the error decreases as n increases, so that using higher-order 
Maclaurin polynomials gives a smaller upper bound on the error in the approximation. This is 


what we want and expect. 
Now we’re ready to answer the questions that we asked at the beginning of this section. 


When n= 1, Formula [4.5] tells us that 


In fact, what we really found before, trusting our calculators, is that 


E}(0.1) = |e! —T}(0.1)| “=” |1.10517091808 — 1.1] = 0.00517091808. 


Thus, the actual error is roughly half of what we produced for an upper bound, but what’s 
important is that the error is less than or equal to our upper bound of 0.01, as it had to 
be. It is not terribly surprising that our upper bound is “far” from the actual error; we used a 


rough upper bound for e°, and then we used that e < 2°. 


The point is that it is more important to end up with a manageable upper bound 
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than to end up with a more accurate, but unmanageable, upper bound. 


When n = 3, Formula/4.5] tells us that 


(0.1)3+1 . 2 0.0002 
E3(0.1) < = 


= = 0.000008333. 
= Bi) 24 


Of course, once again trusting our calculators, you should check that the “actual” error is roughly 
half of this. 


Finally, if we were the ones setting up your calculator with a Maclaurin polynomial to ap- 
proximate e°-! to within 11 decimal places, what order Maclaurin polynomial would Formula[4.5] 
tell us is good enough? That is, what’s the smallest n such that our upper bound on the error 
E#(0.1) is less than 5(10)~"”, so that the digit in the 12th decimal place is less than 5, and 


rounding will give the “correct” answer? 


We want the smallest n such that 


n+l , 
— < 5(10)~2. 


The bad news is that there are no algebraic rules that will let you solve for such an n. The 
good news is that if n is 11, (0.1)"t' = (10)~!? and 2/(12!) is much less than 1, so n = 11 is 
definitely big enough, but how do we find the smallest such n? The easy way — just check the 
other n values less than 11, either by hand or with a calculator. You can start at n = 0 and go 


up, or start at n = 11 and go down. 


We leave it as an exercise for you to show that the answer is n = 7. Hence, the 7th order 


Maclaurin polynomial for e? can be used to approximate e°-! to 11 decimal place accuracy. 


It is worth noting that n = 7 is good enough, and it’s the smallest n that makes our upper 
bound less than 5(10)~!%, but, since our upper bound is not, itself, the actual error (probably), 
it is possible that some more difficult, or different, argument would tell us a smaller n so that 
E%(0.1) is less than 5(10)~'*. But we’re happy with knowing that n = 7 forces our relatively 
easy upper bound on the error to be less than 5(10)~!”, which, in turn, forces the error itself to 
be less than 5(10)~'”. 


Example 4.3.6. Suppose that x is within +0.2 of 7/2. If we use the 4th order Taylor polynomial 
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of f(a) = sina, centered at 7/2, to approximate sin, how accurate can we be certain that our 


approximation is? 


Solution: 


As in the last example, we will find a gross upper bound on the error E} (a; 7/2), and then 
see how big this upper bound is. We shall first look at the situation for general n, and then 
discuss what happens when n = 4; there is something special which happens in this example, 
something which did not occur in the previous example. 


The Taylor-Lagrange Theorem tells us that the error E}(x) in approximating sinz by 
T? (x; 7/2) is given by 


7 |rtl 
Ef (a;1/2) = |R} («2)|-|-9 (« ae = | wen [f° (0)| 


where c is between x and 7/2, and so is also within £0.2 of 7/2. 


We were told that x is within 0.2 of 7/2; this means precisely that |a — 7/2| < 0.2. Thus, 
we know that 
(0.2) 


Ef (x; 1/2) < (n+1)! : Ries 1p 


As in all such problems, we need to find an upper bound on [pete How do we do this? 
The manner by which you produce such an upper bound varies from function to function, but, 


this time, is particularly easy. 


All of the derivatives of sine are equal to plus or minus sine or cosine, i.e., f (n+1) (¢) is one 


of +sinc or tcosc. As the values of sine and cosine are always between —1 and 1, the absolute 


values of plus or minus sine or cosine are all less than or equal to 1. Hence, [ee (c)| <1, and 


(0.2)e4 (n+1) (Oe 
E®(a:1/2) < ZH _. | ap eile! ee 
ORS Goa Ol Garay 
When n = 4, we find 
0.2 4+1 _ 
E+ (x; 1/2) < oe = 0.000002666. 


= LD 
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Recall that we found the Taylor series for sin xz, centered at 7/2, in Example we found 


a (=:5) ao . (« 5) | i (« 5) ji (« >) pes e) 


(5) = 1-5 (2-5) +g (2-5) 
T} (a5 = a ay (es 


which has only three non-zero terms. It’s pretty amazing that these three non-zero polynomial 


terms let us approximate sin x to within +0.000002666, provided that we know that x is within 
+0.2 of m/2. 


However, by being slightly tricky, we can actually conclude that the error is substantially 
smaller. Since all of the odd-powered terms are missing/zero in Formula T? (x; 7/2) = 
T? (x; 71/2), which implies that E+ (x; 1/2) = E}(x; 1/2). Now, our upper bound on E(x; 7/2) 


says that 


Et(a:n/2) = E®(a:0/2) < (0.2)? 0.0000000888 
# (2; 77/2) = F (2; 1/2) = G+! NM . 


So, T ea; 1 / 2) yields a stunningly accurate approximation for x in the given range. For 
instance, when « = 0.1 + 7/2, a calculator tells us that 


sin (0.1 ze ) “=” 0995004165278, 


while 


1 1 s 
Te (0.145; ) al (0.1)? 4 a (0.1)4 = 0.995004166666. 


Example 4.3.7. Let f(x) = cos(3z). 
a. Suppose that |z| < 0.1. Find a “reasonable” upper-bound on the error in approximating 
f(x) by T} (a; 0). 


b. Suppose that |2| < 0.1. By using “reasonable” bounds, find the smallest n such that you 
can prove that the error in approximating f(x) by T;'(2;0) is less than or equal to 0.0001. 
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c. Suppose that we want T? (a; 0) to approximate f(a) to within 0.0001. By using “reason- 
able” bounds, find the largest 6 > 0 such that this accuracy will be achieved provided that 
jz] <6. 


Solution: 


All of these involve finding an expression for the Lagrange form of the remainder for T} (x; 0), 
and then producing a “reasonable” upper-bound for its absolute value. 


We calculate the derivatives: 


f'(z) =—3sin(3z), f(x) = —3? cos(3x), ff” (x) = 3° sin(3z), f(x) = 3 cos(3z),.... 


It is easy to see that f+) (x) equals +3°+! cos(3x) or £3”! sin(3z). 


Therefore, if c is between x and 0 (actually, regardless of what. c is), | f+ (c)| < 3"+!, and 


so the error E(x; 0) in approximating f(x) by T;'(x;0) satisfies the inequality 


antl 


cao 
ab cesy 


ra lae\* (4.7) 


Ey (x;0) = | R(x; 0)| = | 


a. Suppose that |z| < 0.1. Then, the inequality in Formula|4.7| yields 


antl antl 


E® : < n+1 < 
$0) S Gaal” S$ Gap 


(1) 


Hence, E}(x;0), the error in approximating f(x) by T?(zx;0), is less than or equal to 


—(0.1)* = 0.0003375. 


b. We suppose again that || < 0.1 and, hence, as above, we know that 


n+1 
: Cie, 


Bp) S Tara 


We want to find the smallest n so that this is less than or equal to 0.0001. 
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n+1 


Notice that, from part (a), we know that, when n = 3, (0.1)"+* = 0.0003375, 


(n+1)! 
which is bigger than we want. So, we just start checking n’s, starting at n = 4, and 


calculate until we find one so that 


antl i 4 
—— (0.1)"" <0. i 
@+D! (0.1) < 0.000 
We were close at n = 3, so it sure seems like n = 4 or n = 5 will work. 
When n = 4, we find 

— (0.1)° = 0.00002025 < 0.0001. 


5! 
Thus, n = 4 is good enough. 


c. We use that 5 
E(050) < Felt, 
and require this to be less than or equal to 0.0001. 


We want 
34 
ql < 0.0001, 


and so, we need 


Al 1/4 
lal < (52 (0.0001)) “=” 0.073778794646688. 


Let’s try another one. 


Example 4.3.8. Let f(z) =e7*. 


a. Suppose that |v—1| < 0.1. Find a “reasonable” upper-bound on the error in approximating 
f(x) by T} (a; 1). 


b. Suppose that |a — 1] < 0.1. By using “reasonable” bounds, find the smallest n such that 
you can prove that the error in approximating f(a) by T? (2; 1) is less than or equal to 
0.0001. 


c. Suppose that we want T3 (a; 1) to approximate f(x) to within 0.0001. By using “reason- 
able” bounds, find the largest 6 > 0 such that this accuracy will be achieved provided that 
jc—1| <0. 
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Solution: 


As in the previous example, all of these involve finding an expression for the Lagrange form 
of the remainder for T? (2; 1), and then producing a “reasonable” upper-bound for its absolute 


value. 


You should be able to find quickly that f("+) (2) = (-1)"+1e-*. Therefore, | f+ (c)| = 
e~°, and so the error E#(x;1) in approximating f(x) by T} (2; 1) satisfies the inequality 


(@—1)""} = —*__|e-aj"7, (4.8) 


floorD(o ; 
(n+ 1)! 


ER@1) = [RR@D| = Eaayr 


for some c between x and 1, inclusive. 


a. Suppose that |~ —1| < 0.1. This means that —0.1 < «—1< 0.1, and so, 0.9< a < 1.1. 


As c is between x and 1, it follows that c itself is trapped in the same interval, i.e., that 


Cc 


0.9 <c < 1.1. We need an upper bound on e~°. 


Be careful! As a function of c, e~° is 
decreasing, because of the negative sign. This means that smaller values of c yield larger 


values of e~°. Hence, we use that e~° < e~ °°. 


Then, Formula [4.8] yields 


a e799 4a —0.9 
im -1 < —1|" < 
Pe) = Gaye = men 


oer, 


If the e~°® in this upper-bound seems too ugly, you can use another “reasonable” upper- 


bound, and write that, since e~9-? < 1, 


Ey(a;1) < 


Using this nicer upper-bound, we find that E} (a;1), the error in approximating f(x) by 


T?(a;1), is less than or equal to 


1 _ 
m1 (0.1)4 = 0.0000041666. 
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b. We suppose again that |a — 1] < 0.1 and, hence, as above, we know that 


E¥(a;1) < (1, 


1 
(n+ 1)! 


We want to find the smallest n so that this is less than or equal to 0.0001. You can easily 
check that n = 3 is the smallest n that works. 


c. This is more difficult than part (c) in the previous example. Suppose that |# — 1] < 6. 
Then, as in part (a), where 6 was 0.1, we find that 1-6 <a <1+6, and so the c in the 
Lagrange form of the remainder also satisfies 1-6 <c<1+0. 


Now, Formula [4.8] tells us that 


Cc 


Ee 
B® (e:1) = ———— —1 n+l 


where |a — 1| < 6 and, as before, e~° < e~'—%). We conclude that 


What we want is, when n = 3, to find a 6 that’s as big as we can “reasonably” produce, 
in order to make the last quantity above less than or equal to 0.0001. That is, we want to 
know how big we can choose 6 > 0 so that 


—(1-8) 
24 


€ 


64 < 0.0001. (4.9) 


Unfortunately, there is no nice algebra that we can do to “solve” the above inequality for 
an optimally large 6. 


However, we just want some “reasonably large” 6 that makes the inequality true. And so, 
we can just treat different factors separately; for instance, we can look for a 6 > 0 such 
that 

64 
ered and 5g < 0.0001. 


These, together, would certainly imply the desired inequality in Formula[4.9| 


The good news is that we can “solve” the two inequalities above. We find that we would 
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like a 6 > O such that 


6<1 and 6 < ~/(24)(0.0001) = 0.1724 < 0.222. 


Of course, requiring 6 < 0.1 24 already implies that 6 < 1. Thus, choosing 5 = 0.1 7/24 


would be “reasonable”. 


4.3.1 Exercises 


1. Let f(x) = sin (5). 


a. Suppose that |z| < 0.1. Find a “reasonable” upper-bound on the error in approxi- 
mating f(a) by T?(a;0). 
b. Suppose that |x| < 0.1. By using “reasonable” bounds, find the smallest n such that 


you can prove that the error in approximating f(a) by T? (2; 0) is less than or equal 
to 0.0001. 


c. Suppose that we want T? (a; 0) to approximate f(x) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that || < 6. 


2. Let f(x) = e?*. 
a. Suppose that |z| < 0.1. Find a “reasonable” upper-bound on the error in approxi- 
mating f(a) by T?(a;0). 
b. Suppose that |x| < 0.1. By using “reasonable” bounds, find the smallest n such that 


you can prove that the error in approximating f(a) by T? (2; 0) is less than or equal 
to 0.0001. 


c. Suppose that we want T? (a; 0) to approximate f(x) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that |x| < 6. 


3. Let f(a) = sin(5z). 


a. Suppose that |a—1| < 0.1. Find a “reasonable” upper-bound on the error in approx- 
imating f(x) by T? (2; 1). 
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b. Suppose that |a — 1] < 0.1. By using “reasonable” bounds, find the smallest n such 
that you can prove that the error in approximating f(x) by T?(zx;1) is less than or 
equal to 0.0001. 


c. Suppose that we want T? (2; 1) to approximate f(a) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that |a — 1] < 6. 


4. Let f(z) =e”. 


a. Suppose that |a—1| < 0.1. Find a “reasonable” upper-bound on the error in approx- 
imating f(a) by T? (a; 1). 
b. Suppose that | — 1| < 0.1. By using “reasonable” bounds, find the smallest n such 


that you can prove that the error in approximating f(x) by T?(zx;1) is less than or 
equal to 0.0001. 


c. Suppose that we want T?(a;1) to approximate f(a) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that |a — 1] < 6. 


5. Let f(x) =Ing. 


a. Suppose that |a—1| < 0.1. Find a “reasonable” upper-bound on the error in approx- 
imating f(a) by T? (a; 1). 
b. Suppose that |z — 1] < 0.1. By using “reasonable” bounds, find the smallest n such 


that you can prove that the error in approximating f(x) by T?(a;1) is less than or 
equal to 0.0001. 


c. Suppose that we want T? (2; 1) to approximate f(x) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that |a — 1] < 6. 


6. Let f(a) = cos (=). 


a. Suppose that |a—1| < 0.1. Find a “reasonable” upper-bound on the error in approx- 
imating f(a) by T3 (a; Ty: 
b. Suppose that |a — 1] < 0.1. By using “reasonable” bounds, find the smallest n such 


that you can prove that the error in approximating f(x) by T?(zx;1) is less than or 
equal to 0.0001. 


c. Suppose that we want T? (2; 1) to approximate f(a) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that |a — 1] < 6. 
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7. Let f(z) =2°. 
a. Suppose that | +1] < 0.1. Find a “reasonable” upper-bound on the error in approx- 
imating f(a) by T} (a; —1). 
b. Suppose that | + 1] < 0.1. By using “reasonable” bounds, find the smallest n such 


that you can prove that the error in approximating f(x) by T Hi (a; —1) is less than or 
equal to 0.0001. 


c. Suppose that we want T? (2; —1) to approximate f(x) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that |x| < 6. 


8. Let f(z) = 2". 
a. Suppose that |2+ 1] < 0.1. Find a “reasonable” upper-bound on the error in approx- 
imating f(a) by T? (a; —1). 
b. Suppose that |a + 1] < 0.1. By using “reasonable” bounds, find the smallest n such 


that you can prove that the error in approximating f(x) by T? (x; —1) is less than or 
equal to 0.0001. 


c. Suppose that we want T? (a; —1) to approximate f(x) to within 0.0001. By using 
“reasonable” bounds, find the largest 6 > 0 such that this accuracy will be achieved 
provided that |x| < 6. 


9. Let f(a) =x —sing. 


a. What is T?(a; 0)? 
b. Suppose that |a| <1. Find a “reasonable” upper-bound on the error in approximating 
f(x) by T? (2; 0). 
10. Let f(z) =1-— cosa. 


a. What is T?(a;0)? 
b. Suppose that |a| < 1. Find a “reasonable” upper-bound on the error in approximating 
f(x) by T? (2; 0). 


Recall that the Extreme Value Theorem (see [2]) tells us that a continuous function 
defined on a closed interval is bounded. Combining this with the Taylor-Lagrange 


Theorem, it follows that, if f(+!) is continuous on the closed interval la—d,a+], 


M 
then there exists a constant M such that |R}(2x;a)| < @epi® aint. 
n ! 


Use this result in Exercises 11 through 16 to find an upper bound on ER (a; 0) = 


|R7(x;0)| where x is within +d of zero; this amounts to finding the maximum of 
| f+ (c)| for c in the interval [—6, 5]. 
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11. f(#) =e*, d=1. eo 
12. f(x) = cosa, 6 = 0.3. 
13. f(x) =sing, 6 = 0.3. 
14. f(x) =n 4+2),6=0.1. 
15. f(x) =cosha, 6 =1. 
16. f(x) =sinha, 6 =1. eo 
17. If f is an infinitely differentiable function, what is E? (a; a)? 
18. Explain why the Mean Value Theorem is a special case of the Taylor-Lagrange Theorem. 
R*(a;4 
19. Prove that if f+ is continuous at x = a, then lim ee = 
wa xX—-a 
20. a. Suppose that k > 0, and that f(+(a) exists. Use the actual definition of the 
remainder (and the derivative), and iterate l’Hépital’s Rule (see [2]) to prove that 
Rk xa 1 
lim i ) - FP Ua), 
aa (x —a)kt1 (k +1)! 
(Hint: To apply l’Hopital’s Rule, you will need to use that the existence of f+) (a) 
implies that all lower-order derivatives of f exist for all 2 in an open interval around 
a.) 
b. Prove a stronger (and harder) result than that in the previous exercise; prove that, 
Ri (a;4 
if f+ (a) exists, then lim a) a 
a—a| (a —a)* 
21. In this exercise, you will reprove the Second Derivative Test (see [2]). Suppose that 
f’(a) = 0 and that f”(a) exists. 
- 1 
a. Use the results of the previous exercise to show that lim,_., font = 5 f" (a). 
L-a 
b. Suppose that f’”(a) > 0. Conclude that f attains a local minimum value at x = a. 
c. Suppose that f”(a) < 0. Conclude that f attains a local maximum value at x = a. 
22. Use the approach in the previous problem to prove the following Third Derivative Test. 


If f°) (a) exists and is non-zero, f’(a) = 0, and f(a) = 0, then f attains neither a local 


maximum nor a local minimum value at x = a. 
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23. Prove that, if k is even and f’(a) = f’(a) = ... = f®-Y(a) = 0, but f(a) F 0, 
then f attains a local minimum (respectively, maximum) value at x = a if f(a) > 0 
(respectively, f(a) < 0). 


24. Prove that, given the same hypotheses as the previous problem except that k& is odd, f 


attains neither a local maximum nor a local minimum value at x = a. 
25. Let f(x) = e*. For part (a), use the fact, proved in [2], that e < 3. 


3 
k4. 


b. Estimate the value of e to within 0.001 of the correct value. o 


26. In this problem, we will prove that e is irrational. We will do this by contradiction; so, 
assume that e is rational and is equal to p/q, where p and q are both positive integers. We 


wish to derive a contradiction from this assumption. Let f(x) = e”. 


a. Use part (b) of the previous problem to argue that q > 2. 


b. Show that, since we’re assuming that e = p/q, it follows that 


1 1 
fa 1+l+o4in+ 24+ FO), 
q 2! q! 


3 
where 0 < R#(1;0) < q+) by the previous exercise. 


c. Multiply both sides of the equation in part (b) by g!. Reach a contradiction by 
rearranging the resulting equation and arguing that one side is an integer and the 
other side is not. 


In the next exercise, you’ll prove a different version of Taylor’s Theorem; you will 


prove Taylor’s Theorem with Integral Remainder. 


27. Suppose that f is continuously differentiable. Then, the Fundamental Theorem of Calcu- 
lus, Theorem |2.4.10} tells us that f'(t) dt = f(x) — f(a). Also, recall the formula for 
integration by parts, Theorem 1.19] in the context of definite integration : 


b 


b 
i; h(y)a" (y) dy = h(b)g(0) — h(a)g(a) — / h'(y)9/(u) dy. 


a 
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a. Apply integration by parts to i f’(t) dt with h(t) = f’(t) and g(t) = t—<2 to obtain 


the formula e 


f(a) = fla) + f'(a)(e— a) + i) (a — 1) f(t) at, 


a 


provided that f possesses a continuous second derivative. 


b. Apply integration by parts a second time to obtain the formula 


f(a) = fla) + f(@(e@-a)+ 


provided that f possesses a continuous third derivative. 


c. Argue inductively that 


Flee) = Fla) +F (a) a= a) +5 F"a)(ama)? +. 9) (aay (aay f(t at 


provided that f possesses a continuous (n+ 1)-th derivative. Thus, if f possesses a 


continuous (n + 1)-th derivative, 
n d n ¢(n+1) 
Re(aja)=— | (x—-t)"f (t) dt. 
n! Ja 


This is Taylor’s Theorem with Integral Remainder. 
28. Suppose that fort) exists and is continuous. Fix an a and an @. 


a. Explain why there exists M > 0 such that |f("+(t)| < M for all t in the closed 
interval between a and zx. 

b. Use the M from part (a) and Taylor’s Theorem with Integral Remainder to show 
that |RF(2;a)| < —|« - a|"**. Note that we obtained this same bound prior 


(n+ 1)! 
to Exercise 11, by using Lagrange’s form of the remainder. 
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> 
4.4 Functions as Power Series 


In the previous two sections, we first looked informally at approximating functions by their Tay- 


lor polynomials, and then we got formal and used the Taylor-Lagrange Theorem to put bounds 
on the error in the Taylor approximations. Along the way, as a convenient notational device, 
we introduced Taylor series; these are formal algebraic objects, which are like polynomials, 
centered at a, except they don’t (necessarily) stop. We did not say, in those earlier sections, 


that Taylor series define functions. 


In this section, we will discuss when a Taylor series T?°(z;a) defines a function which is 
equal to the original function f(a), for, at least, some values of a close to the center a. It is 
common to say that the original function can be represented by its Taylor series. We will then 


give examples/theorems involving familiar functions. 


Note that this section deals with when the Taylor series T?°(x;a) of a given function f 
defines a function in its own right, and when the function TP? (x; a) equals the original function 
f. This section can be viewed as an introduction to using power series as functions. Results on 
functions that are initially defined as power series, or manipulations of power series functions, 
are dealt with in Section [4.5] and Section 


Suppose that f(a) is infinitely differentiable at a point a. Then, for all n, we have defined 
the n-th order Taylor polynomial 


~ f(a) 
ki 


T}(z3a) = (2 —a)*. 


k=0 


Of course, the Taylor polynomials also give us Taylor polynomial functions and, in the previous 
sections, we looked at the error E#(x; a) in approximating f(x) by using the functions T7'(x; a). 


We also defined the formal algebraic Taylor series: 


Fh) (q 
T° (x; a) = sg ena = 
k=0 : 
_ f(a) 
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which was really just a way of indicating that we weren’t thinking of looking at just one Taylor 
polynomial, but rather were interested in looking at what happens as we take arbitrarily high- 
order Taylor polynomials. 


Our question now is: can we somehow think of the infinite sum in the Taylor series 
as defining a function, and can we say when such a Taylor series function is equal 


to the original function f(x)? 


Since this is a Calculus book, limits may/should come to mind; we can define the infinite 
Taylor sum to be the limit, as n approaches infinity, of the n-th order partial sums, i.e., of the 
n-th order Taylor polynomials. (Note that, here, the limit uses only integer values for n; thus, 
it is the limit of a sequence, as defined in [2], and as we shall see again in Definition [4.5.1}) 


Of course, we can make this definition; the question is: does such a definition give us 


anything useful? As we shall see, the answer is “yes”, and so we make a definition. 


Definition 4.4.1. Suppose that f is infinitely differentiable at a. 


Then, the value, at a point x9, of the Taylor series of f, centered at a, is 


Tp? (xo3@) = lim TP (xo;a) = 


n—oo 


(n) 
te 24 £6, — ay, 
nN: 


_ fa) 


( >, £0) 


3! 


provided that this limit exists; in this case, we say that the Taylor series T?° (x; a) of f, 
centered at a, converges at Zo. 


(xo — a) 


jim | f(a) + £"(@)(@0 ~ @) 


If the limit fails to exist, we say that T?°(x;a) diverges at xo. 


The Taylor series function of f, centered at a, is the function whose domain is the set 
of xo at which TP°(x;a) converges, and its value at any xo in its domain is TP°(xo;a), as 
defined above. The codomain is taken to be all real numbers. The notation for this function 
is the same as that of the algebraic Taylor series: T;(a;a) = T° (x3). 


Usually, the Taylor series function is referred to simply as the Taylor series, and the context 
makes it clear whether we mean as a formal algebraic object or as a function; this is why there 


is not separate notation for the algebraic Taylor series and the Taylor series function. 
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Since we expect the Taylor polynomials of f to approximate f well, near the center, and 
for the approximation to get better as we take higher-order Taylor polynomials, what we 
hope/expect is that, if a Taylor series converges at some 29, then what it converges to is 
f(ao), ie., we expect that f(a) = T?° (x0; a). In other words, we expect, as functions of 
x, that f(a) = a (x;a), at least for x values near a. Surprisingly, this is not always true (see 


Example (4.4.11). 


However, something that’s not surprising is the following theorem, which tells us that it’s all 


a question of whether the remainder approaches 0 or not. 


Theorem 4.4.2. Suppose that f is infinitely differentiable at a. Then, for a given x, 
lim, 66 ER (a; a) = 0 if and only if TP? (x; a) converges to f(x), t.e., if and only if 


f(x) = Tp (a; a). 


Proof. As EP (a; a) = | Re (2; a)|, the equality limp. ER (a; a) = Ois equivalent to lim, —... Rea; a) = 
0, which means precisely that 


lim [f(x) — TP (a; a)| =0, 


n— Co 


Le., 


f(x) = lim Tf (a;a) = TP°(z;a). 


n—oo 


Remark 4.4.3. We should make three comments. 


First, we usually omit the explicit assumption in Theorem [4.4.3] that f is infinitely differen- 
tiable at a; this omission causes no confusion since, when we give a statement involving T° (x; a), 
it is clear that we are, in fact, assuming that the formal algebraic series ee (x; a) exists, ie., 
that f is infinitely differentiable at a. 
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Second, the way in which we usually show that EP (a; a) — 0 is to find some upper bound 
U} (x; a) on E¥(x;a), and then show that U?(2;a) + 0.asn — oo. This implies that E7(x; a) — 
0, since 

0 < Ex(a;a) < Up(@a), 


and E#(x;a) gets pinched to 0 as U?(x;a) — 0. 
Finally, it is, of course, always true that T°(x; a) converges to f(x) at the single point x = a, 


for then all of the (2 —a)’s are 0. What’s interesting is not when T?°(x; a) converges to f(x) at 


just x = a, but rather when T?°(x;a@) converges to f(x) for all x near a. 


Before we look at some examples, we will state an easy lemma, which is useful, in various 


problems, in showing that lim,_.. E7(x;@) = 0. 


Lemma 4.4.4. Suppose that x is a fixed real number. 


1. Suppose that |a| <1. Then, limp—oo |x|" = 0. 


2. Regardless of the size of |a\, 


Proof. Item 1 is easy. If |x| = 0, there is nothing to show. So, suppose 0 < |az| < 1. Let e > 0. If 
€ > 1, then, for all n, |2|" < ¢«. If0 <e <1, then In|z| and Ine are both negative, and selecting 
n > (Ine)/(In|a|) implie that ||” < ¢. This proves the limit statement in Item 1. 


To show Item 2, suppose that no is the first natural number greater than |x|. Then, for all 


n> No, 


0O< \a|” 7 |a|ro-2 |x| |x| \a| || 2 |x|r0-1 || n—not+1 
~ nl (no —- 1)! no not1 n+2 n (ng — 1)! No ) 


where |a|/no < 1. By Item 1, this last factor on the right approaches 0 as n approaches infinity 


and, hence, so does |x|"/(n!), since it gets pinched between 0 and 0. 
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Now we are ready to look at our first example of using Theorem All of our examples 
in this section are important enough to state as theorems. 


The following result is actually circular; if you look back in [2], you’ll see that we defined 


x 


e® = exp(x) using what we now call its Maclaurin series. Nonetheless, it is a good warm-up 


exercise for us. 


Theorem 4.4.5. For all x, the function e* is equal to its Maclaurin series function, 1.e., 
there is an equality of functions 


7 ax ax 


2 = Ite+St+atgt. 
ran 3! 


Proof. Let f(a) = e*. We will use the Lagrange form of the remainder, together with Theo- 
rem |4.4.2 


Fix a particular value xo of x. We have that 


pre oa 
(n+1)! 


|ao|"+1 


0 < EF(xo) = | (Ce 


=e) 


for some c between 0 and ap. As n goes to oo, n+1 goes to oo, and so Lemma tells us that 
the factor on the right, above, |a|"*1/(n+1)!, approaches 0 as n approaches 00. We would like 


to know that the entire term 
|arg|" +1 


(n+1) pee a 
nern(o)| (n +1)! 
approaches 0, for then the non-negative quantity (x0) would be pinched to 0. The question 
is: what do we do about the | f+) (c)| factor? 


Since f(x) = e*, f+) (x) = e?, and so f+) (c) = e®. As e® is an increasing function, if 


xo <0, so that v9 < ¢ < 0, then e° < e® = 1, and if x9 > 0, so that 0 < ¢ < xo, then e° < e* 


In either case, e° is less than or equal to a fixed value, 1 or e”®, and so, since |a9|"*1/(n+1)! > 


0, the non-negative quantity 
leo 
(n+ 1)! 


— 0 


| fF? (6)| " 


as n — oo, and we are finished. 
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Remark 4.4.6. Understand the difference between the result of Theorem and the result 
of Theorem /4.2.11} if f(a) = e*, then, in Theorem |4.2.11| we stated that the formal Maclaurin 


series was 


2 3 at 


ak x x 
7@) >) = es a 
k=0 
There was no claim, at that time, that the Maclaurin series defined a function, or that, even 
if it did define a function, that that function had to be e”; these two conclusions are what we 


proved in Theorem [4.4.5]. 


We also discussed, back in [2], that sine and cosine are equal to their Maclaurin series 
(though we didn’t phrase it that way). It is important here that sina and cosx mean that x 
is interpreted as being in radians. In a way, these series help explain why radians are “more 
natural”, mathematically, than degrees; these series are not correct if the x to which you apply 


sine and cosine is measured in degrees. 


Theorem 4.4.7. For all x, the functions sinx and cos are equal to their Maclaurin series, 
i.e., there are equalities of functions 


Proof. We can actually prove both of these at once. Suppose that f(a) is equal to sin x or cos 2. 


Then, for all n, f+ (2) is equal to +sinz or tcosz. As sinx and cos are always between 
—1 and 1, in any case, for all c, | f+) (c)| < 1. 


Therefore, for each fixed value of 2, 


gntl 


(n+ 1)! 


|a|"* 


(n+D! 


0 < ER) = [FP (O)- 


and, by Lemma the last term on the right above approaches 0 as n approaches oo. 
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As we pointed out when we gave the Maclaurin series for sinz and cosx back in Theo- 
rem /4.2.11} it is helpful to remember that sina is an odd function, and its Maclaurin series 
therefore contains only odd-powered terms, and that cosz is even, and so its Maclaurin series 


contains only even-powered terms, including a constant x° term. 


Remark 4.4.8. You may be wondering why we use the Maclaurin series for e”, sinx, and cos x 
in the theorems above, instead of using Taylor series centered somewhere other than at 0. Was 
it really important for us to center our Taylor series at 0? The answer is: it wasn’t really 


important, but we can only remember/memorize so many things. 


It was unimportant for us to use Taylor series centered at 0 because, in fact, if f(a) equals 


xz 


e”, sina, or cosa, and a is any center whatsoever, then the proofs that we gave when a = 0 


would also show that f(x) is equal, as a function, to its Taylor series function T?°(x; a). 


It’s true that our Taylor error would now have a factor of |x — a|"*t!/(n + 1)! in place 
of |a|"+1/(n + 1)!, and that the c in the f+(c) could now be between a and a, instead 
of between 0 and x, but these things don’t matter; it’s still true that, for a given x and a, 
limp soo |v — a|"*4/(n + 1)! = 0, and that we have fixed upper bounds on |f‘"*+ (c)|. 


So, for instance, we conclude from our discussion and Example that there is an equality 


of functions 


1-5 (2-5) +g(e-5) -ae-5) tal- >) -@e-d * 
a a) Ta 9 6 \” 2 gi \* 2 i a) 


Should you memorize this??? Absolutely not. Memorize the Maclaurin series; they are 
simpler, and they’re what get used in applications 99% of the time. If you need other Taylor 
series, you calculate them on-the-fly. 


Example 4.4.9. In fact, by manipulating known Maclaurin series, you can find new power 


series representations of functions. For instance, we know that, for all 2, 


car = > (IFS = 1- S45 oS 4, (4.10) 
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‘ , T re é 
But, if x is a number, then — — z is just some number, and we can replace every x in 


Formula with ; — a. Thus, we conclude that, for all a, 


Now, as there is an identity sin x = cos (3 - ie), and as raising (3 - x) to an even power is the 


same as raising (« — 3) to that even power, we conclude that 


. 1 T\2 1 T\4 1 1 \ 6 
sing = 1 Tl ( =) ar (« i) 6l (« =) ee (4.11) 


which, of course, is exactly what we concluded in the previous remark. The point here is that 
the previous remark used our earlier calculation of the Taylor series of sina, centered at 7/2, 
and considered bounds on the error; in this example, we simply messed with the Maclaurin 


series of cosx to obtain the result very quickly. 


There is, however, a subtle point here. While our work in this example really does prove the 
equality in Formula[4.1]] it requires a new result, which we won’t get to until Corollary[4.6.8] to 
know that the equality in Formula [4.11] actually tells us that the given power series is, in fact, 
the Taylor series for sinz, centered at 7/2. Right now, we don’t know that, if a function f 
equals a powers series centered at a (on some open interval around a), then that power series 


has to be the Taylor series of f centered at a. It’s true; we just haven’t proved it yet. 


Theorem 4.4.10. (Geometric Series Theorem) Suppose that a is a constant. Then, 
for all x such that |x| <1, the function a/(1—2) is equal to its Maclaurin series, i.e., there 
is an equality of functions 


l-xc 


[oe) 
We ko 2 3 4 
ax = @+ ax + ax + ax” + ax” + ::: 
k=0 


In addition, if a #0, then the infinite series \°~°., ax* diverges if |x| > 1. 


This series is called a (or sometimes the) geometric series. 
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Proof. Note that a/(1 — x) exists as long as x 4 1, but that we are saying that, if a 4 0, the 
infinite sum does not exist if x > 1 or x < —1. You may have guessed, correctly, that this means 
that our proof does not look like our proofs that e”, sin x, and cos x equal their Maclaurin series. 
Let f(z) = a/(1—-). Then, it is easy to see (or prove by induction) that f(a) = 
kla(1 — x)~“*+, Hence, f (0) = kla, and so the n-th order Maclaurin polynomial of f is 


T(z) = a+ ax + ac? + ag® + av* + «++ + az™ = ya 


Recall Corollary|2.1.11| Item c. Replacing b with x in that formula, we have that, if « 4 1, then 


n n+1 n+1 
_ a x -—1 _ a ax 
T}(@) = ) OS eg | 
k=0 
and so 
a a—azr"t} Felidae 
#0) = |f@)-T7@)| = |-*--GS—| = lal 


When |z| < 1, this last quantity approaches 0, as n approaches oo, by Lemma [4.4.4] 


We leave the divergence claim as an exercise. 


As we mentioned earlier, it is possible for a Taylor series of f to converge and, yet, not 
converge to the value of f. 


Example 4.4.11. Consider the function given by 


-1/2? : ‘ 
oe f Y/e" if x £0; 


0, ifx=0. 


Clearly, f is differentiable for x 4 0; we claim that f is also differentiable at x = 0. To see 
this, consider 


f(h)—fO) _ . eR 0 
h-0 h 7 Pa h . 
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We consider each of the one-sided limits, make the substitution u = 1/h, and use l’H6pital’s 
Rule (see[2]). For instance, 


2 2 
ee eae <a 

lim —— = lim = lm —; = lim —; = 
h—0+ h uU—oo 1/u u—oo eu u—oo 2ueru 


The limit as h — O7 is just as easily shown to be 0. 


More generally, it is not difficult to show that, for « 4 0, the k-th derivative of f, f(z), 


is of the form e~!/*” 


- r(x), where r(x) is a rational function, i.e., a quotient of polynomials. 
Substitution and l’H6pital’s Rule (see{2]) can then be used to show that f(a) is infinitely differ- 
entiable everywhere, including at « = 0, and that f and all of its derivatives are 0 at x = 0. 
Hence, the Maclaurin series for f has every term equal to 0, and so, converges to 0 at all x 


values. 


Figure 4.2: The graph of f is very flat near « = 0. 


Therefore, the function TP (x) is the zero function, the function that is always 0, while f(z) 


certainly is not. 
From a graphical point-of-view, what we see is that the graph of f is extremely flat near 


x = 0, in other words, extremely close to being y = 0. 


You can also tinker with the function f above to produce functions with other strange 


properties. For instance, consider 


Then, our discussion about f(a) allows us to quickly see that g() is also infinitely differentiable 


everywhere, including at x = 0, is zero for x < 0, and suddenly is non-zero for x > 0. As we 
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shall see in Theorem |4.5.26} this sort of “weird” activity cannot happen for a function which 


equals a power series. 


In the next theorem, we state a result about In(1 + x) equaling its Maclaurin series, for 
—1<2<1. In fact, we shall give, in this section, the proof only for 0 < x < 1; the rest of the 


proof will be much easier after we have some results from the next section. 


Theorem 4.4.12. For —1 <x <1, the function In(1 +2) is equal to its Maclaurin series, 
i.e., there is an equality of functions 


Proof. We give the proof for 0 < x < 1; the remainder of the proof is in Example|4.6.15 
If « = 0, then we know that the convergence and equality hold. So, suppose that 0 < a < 1. 


If you haven’t done so already, you should do the exercise from Section [4.2] to show that the 
Maclaurin series for f(a) = In(1+ 2) is the given series. The key steps are, for k > 1, to recall 
that k! = k- ((& —1)!) and to check that 


f(x) = (-1)"(k- DL + a)“. 


Therefore, for n > 1, there exists a c, where 0 <c< x < 1, such that 


n! oe ale ol a. ih 2 gt 
(n+1)! \l+c ~ n+1 \1ite n+l? 


which, since 0 < x < 1, approaches 0 as n approaches oo. Thus, once again, E(x) gets pinched 


fOr ey 
ra, a 


BH) = | 


and must approach 0. 
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Remark 4.4.13. We have one cool remark about Theorem /4.4.12]and one warning. 


Let’s start with the warning. It is easy to get used to seeing factorials in Taylor and Maclaurin 
series and to mistakenly put factorials on the denominators of the Maclaurin series for In(1+<). 
Don’t do this! Make a special mental note: no factorials appear in the Maclaurin series for 
In(1+ 2), at least, not after it’s simplified. 


The cool remark is that, when « = 1, we obtain that 


The infinite sum/difference on the right is known as the alternating harmonic series, in contrast 
with the harmonic series, which has all plus signs. As we shall see in Proposition |5.2.16} the 


harmonic series diverges to oo. 


A weird thing happens when a sum/series converges, but diverges when you add up the 
absolute values of all of the summands: it is possible to rearrange the terms (including the 
negative signs) that you’re adding and change the sum. By taking lots of positive terms earlier, 
you can make the sum diverge to oo. By taking lots of negative terms earlier, you can make 
the sum diverge to —oo. And, by rearranging in the right ways, you can make the infinite sum 
converge to any real number that you want. Thus, the order in which you add the terms changes 


the value of the summation, something which is not the case for finite sums. 


We shall discuss such conditionally convergent series more carefully in Section [5.4] 


The fact that the next function, (1+ 2)?, equals its Maclaurin series was discovered by Sir 
Isaac Newton. The Maclaurin series which appears is called the binomial series, and the fact 


that it equals its Maclaurin series, for the given values of x, is called the Binomial Theorem. 


A direct proof of the Binomial Theorem, using Lagrange’s form of the remainder, is problem- 
atic, and so we outline and have you produce an alternative proof in Exercise[43]in Section [4.6] 
We include the statement in this section, since it is one of the fundamental Maclaurin series 


that is worth memorizing. 
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Theorem 4.4.14. (Binomial Theorem) Let p be a real number. Then, for -—1 <a <1, 
the function (1+)? ts equal to its Maclaurin series, t.e., there is an equality of functions 


fie? = 4 dpe + me) 24 me De®) 


xg? 4 


p(p — 1)(p — 2)(p — 3) ot 
4! 


If we let (*) = 1 and, fork > 1, let (2) = p(p — 1)(p— 2)---(p—k +1)/(K!), then the 


Binomial Theorem tells us that, for |x| < 1, 


Remark 4.4.15. The expression (2) is read as “p choose k”. When p is a positive integer > k, 
p choose & is used often in probability and combinatorics; it represents the number of possible 


(unordered) ways of choosing k objects from among p objects. 


When p is a positive integer, all of the summands in the Binomial Series for (1 + x)? will 
be zero after the x? term for, after that, the factor (p — p) will appear in the numerator of the 
coefficient. 


For instance, if p = 3, Theorem |4.4.14| yields the familiar binomial expansion 


3-1 2, 3B=-NE-2) 5, 


1 3 =] 
(14+ 2) + 3a” + 5 F 


3(3 = 1)(8=2)3~-3) 4 , 38-1)(8-2)8-3)3-4) 5 


24 120 


1 + 3a + Sa? + 2°. 
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The value of the Binomial Theorem is that it holds when p is negative, a rational number, 


or even an irrational number. 


Example 4.4.16. Use the Binomial Theorem to approximate V3.8. 


Solution: 


The instruction “to approximate” is pretty vague. We’ll give one easy approximation. 


To use the Binomial Theorem to get a “good” approximation, we need to apply it to some- 
thing of the form (1+ 2)!/?, where « is “close to” the center of the Binomial Series, i.e., where 
x is close to 0. But 3.8 = (1 + 2.8)!/?, and 2.8 doesn’t seem very close to 0. So, what do we 
do? 


Well...we think harder. We think: 


1=02 
V3.8 = 4.3 = Vay - = 2-(1—0.05)'/?. 


Now, we can apply the Binomial Theorem with « = —0.05, which is substantially closer to 0 


than 2.8 was. 
What is the easiest non-trivial approximation that we can obtain for (1 — 0.05)!/? from the 


Binomial Series? The one we get from the linear approximation, that 
(l+a2)? = 1+pz, 


when z is close to 0. Thus, (1 — 0.05)!/? © 1 — 0.025 = 0.975, and so 


V3.8 = 2+ (1—0.05)!/? = 2-0.975 = 1.950. 


As you can check on your calculator, 3.8 “=” 1.949358868961793. So, our quick approx- 


imation is pretty good. 
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4.4.1 Exercises 


1. 


2 


Recall that 


1 
In2 = 1 oars 
n a7 3 a F =F = 


Approximate In 2 by using the partial sum containing the first 4 terms in the infinite series. 
Compare this with the calculator value of 0.693147180560 for In2. Would you say that 
your approximation is “good”? Does the approximation get better if you use the first 10 


terms? 


a. Using that 


2 
x 
we see that 6 G —-1l-“- =) is best approximated by what power of x when = is 
close to 0? 


b. Use your approximation from part (a) to estimate the value of 


v=6(o 1-01 or). 


2 


Compare your estimate with the value of v from your calculator. 


a. Using that 


sing = 2 + free, 
we see that x — sin is best approximated by what function of the form cx” when x 


is close to 0? 


b. Use your approximation from part (a) to estimate the value of v = 0.1 — sin(0.1). 
Compare your estimate with the value of v from your calculator. (Make certain that 


your calculator is set to use radians in trig functions.) 


a. Using that 


we see that 2(1 — cos) is best approximated by what power of x when z is close to 
0? 
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b. Use your approximation from part (a) to estimate the value of v = 2(1 — cos(0.01)). 
Compare your estimate with the value of v from your calculator. (Make certain that 


your calculator is set to use radians in trig functions.) 


5. a. Using that 


2 3 4 5 6 
x x x x x 
In(1 + = 
i aaa ce a a a 
- In(1 
we see that Seats) is best approximated by what constant when z is close 


ge 


to 0? 


b. Without using l’Hopital’s Rule, calculate 


im 2 +In(1 +2) 
xz—0 2 
6. a. We know that, for |x| < 1, 
1 2 3 4 
=Sl+2et+e°4+e° + 2° 4 


So, when z is close to zero, 1/(1— x) is best approximated by what quadratic poly- 


nomial? 


1 
b. Use your approximation in part (a) to estimate ior Compare this with the calcu- 
lator value of 1/1.01. : 


if 
7. The Binomial Theorem, Theorem|4.4.14| with p = 3? tells us that 


(ipa ye 4 1 | (3) (-3) 24 


Use the first three terms of this series to estimate (1.09)1/°. 


1/3. 


Compare this with the 


calculator value of (1.09) 
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10. 


11. 


12. 


13. 


14. 


Use the first two terms of the binomial series to approximate 31, and compare this with 


the value from your calculator. 


Explain how the Binomial Theorem, Theorem|4.4.14} yields the Geometric Series Theorem, 
Theorem |4.4.10} in the case where a = 1. 


a. 


Find the Taylor series for e”, centered at 5. Either use summation notation, or write 


out the first 5 non-zero terms, followed by dots. 


. Determine the x values for which your Taylor series in part (a) equals the function 


ev 


. Find the Taylor series for cosa, centered at 7/4. Either use summation notation, or 


write out the first 5 non-zero terms, followed by dots. 


. Determine the x values for which your Taylor series in part (a) equals the function 


COS & 


Find the Taylor series for sinx, centered at 7/4. Either use summation notation, or 


write out the first 5 non-zero terms, followed by dots. 


. Determine the x values for which your Taylor series in part (a) equals the function 


sin z. 


Find the Taylor series for In(1 + 2), centered at 2. Either use summation notation, 
or write out the first 5 non-zero terms, followed by dots. Hint: You may wish to use 
the formula for the k-th derivative from the proof of Theorem |4.4.12 


. Can you determine any x values, other than 2, for which your Taylor series in part 


(a) equals the function In(1 + x)? 


Find the Taylor series for 1/(1 — x), centered at 2. Either use summation notation, 
or write out the first 5 non-zero terms, followed by dots. 


. Can you determine any «x values, other than 2, for which your Taylor series in part 


(a) equals the function 1/(1— x)? 


15. Let f(x) = ze’. 


a. 
b. 


Cc. 


d. 


Show that the k-th derivative of f, f‘) (x), is equal to (x + k)e”. 
Find the Maclaurin series for f(x), using the definition of Maclaurin series. 
Why is your answer to part (b) unsurprising? 


Show that f(z) is equal to its Maclaurin series for all z. eo 


16. Suppose that f(x) is infinitely differentiable at 0. Then, g(x) = xf() is also infinitely 
differentiable at 0. 
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a. Show that, fork >1, g(x) = af (ax) +kf@-Y (a). 

b. Show that, if the Maclaurin series for f(x) is 77°, c;z*, then the Maclaurin series for 
ola) is So ea. 

c. Suppose that f(a) equals its Maclaurin series for all 2 in some set I. Show that g(x) 


equals its Maclaurin series for all x in J. (Hint: Do not use the Lagrange form of the 


remainder; use the definition of the Taylor remainder and the properties of limits.) 
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= 
4.5 Power Series as Functions I: 


Definitions & the Ratio Test 


In the previous section, we began with a function f(#) and discussed when it was equal to its 
Taylor or Maclaurin series or, as people frequently say, when a given function can be represented 
by its Taylor series. We saw that f(x) = T?°(x;a) if and only if the error E''(x;a) approaches 


0 as n— oo. 


Showing that the error approaches 0 actually implies two essentially separate things at once; 
it implies that TP (x, a) converges to something, and that the something that it converges to 
is precisely f(x). 

But, what if you start with just a power series 7-9 cx(x — a)*, instead of some pre-defined 
function f(x)? Can we say when the power series, as a formal algebraic object, can be used to 
define a function? And, even if we can use a power series to define a function, is there any 


point to doing so??? 


The answer to both of these questions is: YES. Power series define functions wherever they 
converge and, to a large extent, as far as algebraic operations, differentiation, and integration 
are concerned, functions which are defined by power series can be treated just like polynomials, 


polynomials which never end. 


In this section, we will begin our general investigation of using power series to define functions; 
though, actually, we introduced you to this idea in [2], when we defined the exponential function 


exp(x) using limits of polynomials. 


We shall give fundamental definitions and results on convergence, and the domain of a 
power series function. In particular, we will state and use the Ratio Test to find the radius of 
convergence, a fundamental notion related to the domains of power series functions. We shall 
also define and discuss real analytic functions; these are functions which are locally equal to 


power series, but the center of the power series is allowed to move. 


We will not prove the Ratio Test, or even state other convergence tests, until Chapter 
However, before going into such an extensive discussion of convergence, we will, in the next 
section, Section [4.6] look at manipulations of convergent power series, in order to show why it’s 
so nice to have functions defined, or represented by, power series. 


We discussed partial sums and convergence of Taylor series earlier (recall that the n-th order 


partial sum of a Taylor series is just the n-th order Taylor polynomial). These are special cases 
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of the general notions of partial sums, infinite series, and convergence of series, which we will 
look at in detail in Chapter [5] However, we need to briefly define and discuss these terms here, 


so that we can look at them in the specific case of power series. 


We defined sequences and their convergence back in [2], and have referred to them a number 
of times throughout this textbook; they will now be so crucial to us that we want to give the 


definition again. 


Definition 4.5.1. Suppose that m is an integer, i.e., is in the set Z. Denote by Z>m the 
set of integers which are greater than or equal to m. 

A function b: Z>m — R is called a sequence (of real numbers). In place of b(n), it 
is standard to write by. 


We say that the sequence b, converges to (a real number) L, and write 
limpso bn = L if and only if, for alle > 0, there exists an integer N > m such that, 
for all integers n > N, |b, — L| <e. 

If a sequence does not converge to some L, then we say that the sequence diverges. 


When discussing the limit of a sequence, the initial value, m above, of the index of the 
sequence, n above, is frequently omitted, since all you typically care about is what happens 


when n is big. 


Example 4.5.2. Consider the sequence 6, = 2n +1, for n > 0. This is the sequence of odd 


natural numbers: 


This sequence clearly diverges to co. 


The sequence a,, = (—1)”, for n > 0, consists of alternating 1’s and —1’s: 


apo = 1, a; = -1, a2 = 1, a3 = —1, sey 


and, therefore, diverges, since there is not one limit L that it gets arbitrarily close to. 


On the other hand, the sequence 
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converges to 0, while the sequence 


converges to 1. 


It will be convenient to use k now, instead of n, for the indexing variable in our sequence; 
of course, this doesn’t affect what the sequence is; the sequence b, = 2n + 1, for n > 0, is the 
same as the sequence by = 2k + 1, for k > 0. 


Given a sequence of real numbers, by, where k > m, we are interested in defining the infinite 


sum °°, be. The infinite summation is usually referred to as an infinite series. 


Definition 4.5.3. Given a sequence by, fork > m, we define, for each n > m, the partial 
sum to be >>;_,,, bx, and the infinite sum or infinite series or, simply, series to be the 
infinite summation \°°_,,, bk, which, technically, consists of the sequence b;, together with 
the summation instruction/symbol, telling you to add the sequence. 


The sum of the series or value of the series )>”-_,,, by, is the limit as n — oo of the 


partial sums, and we write 
Co n 
n— oo 
k=m k=m 


provided the limit exists, in which case we say that the partial sums converge, that the series 
converges, or simply that the infinite sum exists and is equal to the limit of the partial sums. 

If the limit of the partial sums does not exist, then we say that the series diverges or, 
simply, that the infinite sum does not exist. 


Note that there are actually two sequences associated with an infinite sum/series: the se- 
quence of things you’re adding, the terms, and the sequence of partial sums 8, = S pm dr- 


What we’re interested in is what happens to the sequence of partial sums as n — oo. 


There should be no confusion here. We are trying to define what an infinite sum should 
mean. Intuitively, it means the limit as you add more and more terms; this means that an 


infinite sum is the limit of the partial sums, NOT the limit of the terms. The terms are 
just the individual summands; they are not sums themselves. 


What does any of this have to do with power series? Consider a power series, centered at a, 
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co 

p(x) = om cy(a —a)*. This means that we have specified the infinite sequence of coefficients co, 
k=0 

C1, CQ, .+.- 


Now, when we plug in a specific x value, x = xo, the power series gives a series of real 


k 


numbers; letting by = cz(ao — a)", we have the series 


Me 


by = Sex (xo — a)*, 
k=0 


> 
Il 


0 


and we just defined such an infinite sum in Definition It’s defined to be the limit of the 


partial sums 


n 

. k 

Jim S- Ce(Zo — a)”, 
k=0 

provided that this limit exists. 


Thus, as with polynomials and Taylor series, we define the partial sums of a power series. 


Definition 4.5.4. For all integers n > 0, we define the n-th order partial sum of 


[oe} 


pi“) = S- cy(a — a)" to be the polynomial, or polynomial function, 
k=0 


a)? + ¢3(4 — a)3 +--+ + e,(x — a)”. 


And now we can define a power series function. This is just a mild generalization of our 
definition in Definition of a function defined by a Taylor series; however, now, we allow 


arbitrary power series, instead of restricting ourselves to Taylor series of given functions. 


Definition 4.5.5. Suppose we have the power series, centered at a, p(x) 


Then, the value of p(x) at a point 2% is p(%o) = limp. p” (x0) = 


lim [co a)? + ¢3(ap — a)? +--+ + en(29 — @)”], 


n—oo 


provided that this limit exists; in this case, we say that the power series p(x) converges at 
LO. 


If the limit fails to exist, we say that p(x) diverges at xo. 
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The power series function p(x) is the function whose domain is the set of xo at which p(x) 
converges, and its value at any Xo in its domain is p(xao), as defined above. The codomain 
of a power series function is taken to be all real numbers. The notation for this function is 


the same as that of the algebraic power series, p(x), and we frequently use the term “power 
series” to mean either the formal algebraic series or the function, letting the context make 
the distinction clear. 


Remark 4.5.6. It is trivial, but important, that every power series, centered at a, converges 
at x = a; for there, the value of every term, other than the constant term, is zero, and so every 
partial sum is simply the constant term co. Thus, 


pia) = lim p"(a) = lim oc = co. 


noo n—-oo 


The real question is: where else, besides the center, does the power series converge? 


Example 4.5.7. Recall from Section [4-4] that the Maclaurin series for e”, sinxz, and cos are, 


respectively, 


ok oo g2ktl a) : 2k 
an ae “OEED! ue eee (2k)! 
k=0 k=0 k=0 


In fact, in Section [4-4] we did not consider the separate question of which x values make these 
series converge; we looked only at the question of which x values make the series converge and 


make what they converge to equal to the value of the original function. 


However, the three power series above converge and equal the original function for all 7 and 
so, in particular the domain of each of the three power series functions above is the entire real 


line (—o0, 00). 


In Section we also saw that the domain of the Maclaurin series (function) of a/(1—<), for 
a # 0, is precisely (—1,1), for, in Theorem|4.4.10} we included the statement that the Maclaurin 
series diverges outside of the interval (—1, 1). 


On the other hand, you have to be somewhat careful when looking back at Section [4.4] for 


the domains of the Maclaurin series of In(1 +) and (1+ 2)”. In Section|4.4| we gave intervals 
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on which the Maclaurin series of these two functions are equal to the functions themselves, but 
there was no claim that the Maclaurin series diverged outside of those intervals; in theory, the 
sets of points on which the Maclaurin series converge could be larger than the intervals on which 
the Maclaurin series agree with the original functions. 


As we shall see later, the domain of the Maclaurin series function of In(1 + 2) is precisely 
the interval on which the series converges to In(1 +2), namely, the interval (—1, 1]. 


However, the domain of the Maclaurin series function of (1+x)?, the binomial series, depends 
on p, and can be any one of three possible intervals: (—1, 1), (—1, 1], or [—1, 1]. While it is beyond 
the scope of this textbook, it can be shown that the binomial series for (1 + x)? converges at 
x = —1 if and only if p > 0, and converges at x = 1 if and only if p > —1. Thus, for instance, 
when p = —1/2, the interval on which the Maclaurin series of 1/\/1 + x converges is (—1, 1]. 


Of course, we haven’t given an example of a power series that’s not the Maclaurin series of 


some “nice” function. What can we say about the domain of the power series function 


Below, we give a general result that tells us that the domain of this, or any, power series 
function must be an interval, and, in fact, that interval must have, as its center, the center of 
the power series, which is 5 for the series above. We will then state the Ratio Test, which tells 
you, in many cases, how to find the radius (half the length) of the interval on which the series 


converges. 


Recall now that an extended real number is a real number or too. We need one more 


definition before we can begin with the results and examples. 


Definition 4.5.8. Suppose that R > 0 is an extended real number, i.e., a non-negative real 
number or co. 


Then, an interval centered at a is an interval which contains a and which has end- 
points a— R anda+R, where the interval may or may not include one or both of the 
endpoints. 


Note that, if R = ov, then the interval is (—co,0o), and if R = 0, then the interval 
(which was required to contain a) must be the closed interval which contains only a, i.e., 
must be [a, a]. 


The extended real number R is called the radius of the interval. 
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As we mentioned, back in Example|4.5.7| there is the following result. 


Theorem 4.5.9. The set of points at which a power series, \yP- cx (x — a)*, centered at 
a, converges is an interval centered at a. 


In addition, at points x in the interior of this interval (if there are any), the series 
reo lex «|e — a|*, whose terms are the absolute values of the original series, converges. 


Proof. This result is beyond the scope of this textbook. See Theorem 4.5.2 of [4]. 


In light of this, we make another definition. 


Definition 4.5.10. The interval of points at which a power series converges, that is, the 
domain of the power series function, is called the interval of convergence of the power 
series, and the radius of that interval is called the radius of convergence of the power 
series. 


k centered at a, the radius of convergence is the 


Thus, for a power series \\;~ 9 Ck (x — a) 
unique extended real number R > 0 such that, if |a —a| < R, then the series converges, and 
if |x — al > R, then the series diverges. 


Remark 4.5.11. The second paragraph of Theorem [4.5.9] means that, when R > 0, at points 
where |a — a| < R, ie., at points x in the open interval (a — R,a + R) (meaning the open 
interval (—co, 00), when R = oo), the power series converges in a fairly strong way; it converges 
absolutely, which we won’t discuss any further until Section [5.4] 


Example 4.5.12. In Example /4.5.7| using our current terminology, what we saw is that the 
intervals of convergence of the Maclaurin series for e*, sinz, and cos are all (—oo, 00) and, 


hence, the radii of convergence for all of those series are oo. 


We also discussed in Example[4.5.7|that, for a # 0, the Maclaurin series for a/(1—~2), i-e., the 
geometric series, has (—1,1) as its interval of convergence, and so, has radius of convergence 1. 
Furthermore, the Maclaurin series for In(1+) and (1+)? also have radii of convergence equal 
to 1, with corresponding intervals of convergence (—1,1] and, depending on p, one of (—1, 1), 
(—1, 1], and [-1, 1]. 
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However, we have yet to explain any method whatsoever for determining the radius or interval 


k 
of convergence of essentially arbitrary series, like )77° aoe from Example|4.5.7| For this, we 


use the Ratio Test, below. 


There are many theorems related to the convergence of power series; by far the most impor- 
tant of these is the Ratio Test. We shall discuss the Ratio Test in this section, and defer the 
other convergence tests until Chapter [5] The Ratio Test is a theorem about series of constants, 
but which, when applied to many power series, yields the radii of convergence. However, the 
Ratio Test, in its usual form, can never tell you about convergence/divergence at the endpoints 


of the interval of convergence. 


As we shall see in Theorem |5.3.23} the proof of the Ratio Test just amounts to comparing 
with what happens for geometric series (recall Theorem |4.4.10). 


Theorem 4.5.13. (The Ratio Test) Consider the series S- bp. 


k=m 


1. If there exists r <1 (and, necessarily, > 0) and an integer M > m such that, for all 
k>M, 


by #0 and ae <r, 
bi 


then the given series converges. 


2. If there exists an integer M >m such that, for allk > M, 


b 
by £0 and 7 Si, 


k: 


then the given series diverges. 


b 
In particular, suppose that jim i. exists, as an extended real number; call its value L. 
Then, 
a. if LD <1, the given series converges; 


b. if LD > 1, including L = ov, the given series diverges; 


c. if L=1, the given series may converge or diverge. 
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Remark 4.5.14. You usually try to calculate the limit L in the Ratio Test first, and hope that 
it exists and is unequal to 1. If L fails to exist or equals 1, then you consider cases 1) and 2). 


Be aware, however, that, if DZ exists and L = 1, then there cannot possibly exist an r as in 
case 1) of the Ratio Test, and so you will not be able to use the Ratio Test to conclude that the 
series converges. On the other hand, even if L = 1, you may be able to use case 2) to conclude 
that the series diverges. 


Also, if the limit L fails to exist, cases 1) and 2) may enable you to conclude convergence or 


divergence of a given infinite series. 


Remark 4.5.15. Due to the nature of the Ratio Test and because factorials frequently appear 


in series, it is worth recalling how factorials work, before we look at examples. 


From the definition, for k > 0, 


(kK+1)! = (K+ 1)k(k— 1)(k — 2)--- (38)(2)(1) 


and 
(2k +2)! = (2k + 2)(2k + 1)(2k)(2k — 1)(2k — 2)--- (3)(2)(1). 
Hence, 
(k+ 1)! = (k+U)(k) and [2(k +]! = (2k+2)(2k+ D4), 
_ +0 egy ang POTD! © ong oyaeey 
kl (2k)! : 


The point of this is not that you should memorize these specific formulas for quotients involving 
factorials, but rather that, if you just think about what factorial means, such formulas become 
quick and easy to derive. 


Example 4.5.16. Let’s consider six different series of constants, series whose terms, for k > 1, 


are given by 
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What does the Ratio Test say about the six corresponding infinite series/summations? 


Since 
2 | eigen 1/381 ao kt a 
pees ak k-o0 1/3 ‘cad se 
the Ratio Test tells us that 
= a 1 i 1 1 
doe = oe =~ 37 32 | 33 34 pee 
k=1 k=1 


converges. 


In fact, 0724 ge is equal to 7729 5 - eu (if this isn’t clear, write out the first few terms), 
and so this series is a geometric series; hence, we can apply Theorem/4.4.10| with a = 1/3 and 


x = 1/3 to conclude that, not only does }>7°., ax converge, but it converges to co = 1/2. 


: eet Oe = am 3*: 


Since 
| eta oo wer : 
gato] by | ~ kee SF ene 
the Ratio Test tells us that 
So be = S73 = 3 + 3? + 33 + 
k=1 k=1 


diverges. Of course, it should be obvious that the partial sums are approaching oo. 


As with the }> az, the current series is geometric, but, this time, with a = 3 and x = 3, and 
so Theorem|4.4.10]already told us that the series }77°., by diverges. 


aS k 
© rack = Degad ty 
Since 
a. lees 3HI(k4 1! BAHL kl 
_ = es es i ee 1 
ae ko SP /KI eos fell ete 
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the Ratio Test tells us that 


= oS 3k 3 3? 33 34 
Ee 2 eG, "o 7 37 at 
k=1 A=1 


converges. 


This series may look familiar. In fact, Theorem tells us that 


3k 
De oie Powe = == 
PS tae oe ar as =1+)°5 


oo 
k=1 


and, thus, not only does pear 2 converge, but it converges to e? — 1. 


ede = Seale 


Since 


case c) of the Ratio Test, Theorem|4.5.13} tells us that, at this point, we cannot conclude whether 


been d ‘ di : 
p— dk converges or diverges. 


On the other hand, case 2) of the Ratio Test applies, since, for all k > 1, 


de+1 
dk 


1 
= 1 eee 
| 


Therefore, 


eel. ee es eee 


diverges. 


_4)yk41 
° ae = Lae and aide a ae } ‘ 


The terms of these two series have the same absolute value and, hence, both of these series 


lead to the same ratio in the Ratio Test: 


tm 2ARAY _ on = im StY)-1 _ a -)\ 21 
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Once again, case c) of the Ratio Test, Theorem |4.5.13] tells us that, at this point, we cannot 


conclude whether the two series converge or diverge. 
This time, case 2) of the Ratio Test does not apply, since, for all k > 1, 
1/(k +1) 1 


— 1 1. 
Lk eel 


You might hope that case 1) of the Ratio Test would allow us to conclude the convergence of 
both of the series, but, in fact, there is no r < 1 so that, for arbitrarily large k, 1 —1/(k +1) is 
always less than or equal to r; as we mentioned in Remark [4.5.14] no such r can exist precisely 
because limg_,o0(1 — 1/(K + 1)) = 1. 


Actually, as we discussed in Remark |4.4.13] the series 


, = = 1 eae 
di fe es k 5) 5 3 4 + 


is called the alternating harmonic series, and converges to In2. 


However, the harmonic series 


°° eal 1 1 1 
ee 


diverges to co, as we shall see in Proposition |5.2.16 


Great. So now we have the Ratio Test, and we’ve looked at a few examples of how to use it. 
But, what does this have to do with power series? Another extended example should make it 


very clear how the Ratio Test allows you find the radius of convergence of many power series. 


Example 4.5.17. Let’s see what the Ratio Test tells us about the radius of convergence of each 
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of the following power series: 


k 2k 


[eve) (oe) co 4 3 2k ge ay —5 k 
Sr LOS ae Schl, and ys 
k=0 k=0 


When we apply the Ratio Test to each series, we do so with the variable x in the test, and 
we obtain restrictions on x which guarantee convergence. We always explicitly, or implicitly, 
assume that x is not the center of the power series while applying the Ratio Test; we know that 
all power series converge at their centers, for every term becomes zero, except for the constant 
term. Of course, every term, other than the constant term, being zero causes a problem in the 
Ratio Test, since then the quotients in the Ratio Test are not defined for large values of k. Thus, 


we assume we’re not taking x to be the center. 


k 


oe 
k=0 


k 
We apply the Ratio Test with by = a (assuming that « 4 0), and find 
. | beta _ |e (k+ 1)! . k! : |x| 
L= 1 = | a a oS S| = 
ae bp ae xk /k! Pate rf (k +1)! ko k +1 Ne 


regardless of the value of x. That is, no matter what number we put in for x, the Ratio Test 
limit comes out less than 1; this means that the power series converges for all x. Therefore, 


the interval of convergence is (—0o, oo) and the radius of convergence is R = oo. 


Of course, we knew this already; this power series is the Maclaurin series of e”, and converges 
to e”, for all x. But, it’s nice to know that we can use the Ratio Test to conclude that the series 


converges for all x, without having to know ahead of time that the series equals e*. 


e dO) ar 


o 2k 
k=0 

Hopefully, you recognize this as the Maclaurin series for cosz; hence, we know that power 
series converges to cos 2, for all x, but let’s see if the Ratio Test will allow us to conclude that 


the interval of convergence is (—co, 00). 


Note that it’s extremely convenient that the indexing of the terms of this power series are 
designed to hit only the non-zero terms, i.e., even though the coefficients of all of the odd powers 


of x are zero in this series, our indexing skips those zero terms, and only adds the even powers of 
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x. Missing the non-zero terms makes the ratios in the Ratio Test defined, since we don’t divide 


by zero. 


We apply the Ratio Test and calculate 


S| 5,4 G (2k)! 2? 
= 1m av oe 


= NE 2) iy ee 1 
x2 (2k)! sm) - os OhEaGREI) 


k—oo 


regardless of the value of x. So, yes, once again the Ratio Test allows us to conclude that the 


interval of convergence is the entire real line (—oo, co), and so the radius of convergence is oo. 


: (8) 2 So (+3) 


k=0 k=0 


We apply the Ratio Test and calculate 


(x + 3)R44 /7etd 2 ee jo+ 3) _ jz +3] 


Bee 
noo | (a@-+3)*/7 9 7 


k— oo 


Now, the Ratio Test tells us that the series converges if |a + 3|/7 < 1 and diverges if |x +3] > 1. 
Multiplying by 7, we find that the series, which is centered at —3, converges if |a — (—3)| < 7 
and diverges if |x — (—3)| > 7; looking back at Definition [4.5.10] you can see that this tells us 
that the radius of convergence of the series is 7. 


Thus, the largest open interval on which the series converges is the interval (—3—7, —3+7) = 
(—10, 4), but this easy use of the Ratio Test does not tell you what happens at the endpoints of 
this interval, for the endpoints are exactly where the limit in the Ratio Test comes out to equal 
1. This specific example shows you what happens in general: when you apply the Ratio Test, 
Theorem [4.5.13} simply calculating the limit DL in cases a), b), and c) can never tell you about 


convergence/divergence at the endpoints of the interval of convergence. 


However, in this particular example, case 2) of Ratio Test allows us to conclude that the 
series diverges at both endpoints of the interval of convergence. This isn’t terribly surprising; 
the proof of the Ratio Test uses that we know exactly what happens for geometric series and, 


while it’s slightly disguised, the series 77° , (ae) is, in fact, geometric. 


How do you see this? Well...the most basic geometric series, when a = 1 in Theorem|4.4.10 


where the equality holds for |z| < 1, and the series diverges for |z| > 1. Replacing the x 
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everywhere with the quantity (2 + 3)/7, we find that 


[eve) k 
1 r+3 
1 — 243 ->( 7 ) ’ 


where the equality holds for |*#2| < 1, and the series diverges for |*42| > 1. Note that the 


convergence/divergence is exactly what we found by using the Ratio Test. 


Co 


° S- kla*: 


k=0 


We calculate the limit L from the Ratio Test: 


L = lim 


k—oo 


| (k+1)la**1 


If « £0, this limit is 00, which is greater than 1, regardless of the value of x. Thus, the Ratio 
Test tells us that this series converges only at the center x = 0 or, equivalently, that the radius 


of convergence is 0. 


x—5)* 
. yee : 


oo 
k=1 


We calculate the limit L from the Ratio Test: 


(w—5)*t? 2 
. BRAT Ck PI)2 (x —5)F+1 3° k? |x — 5| 
L = lim |—— | = . . lim {| —— 


Dividing by k in the numerator and denominator inside the parentheses, we find 


2 
ieee _ |e-5| 
3 ko \14+4 3 


Therefore, the series converges when |x—5|/3 < 1 and diverges when |#—5|/3 > 1, i.e., converges 


when |x — 5| < 3 and diverges when |x — 5| > 3. Hence, the radius of convergence is 3, and the 


largest open interval on which the series converges is the interval (5 — 3,5 + 3) = (2,8). 
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We leave it as an exercise for you to show that even the more-complicated parts of the Ratio 
Test do not allow you to conclude anything about convergence/divergence when x = 2 and 


x= 8. 


One final note on applying the Ratio Test to this last series: as we saw, it can be very 
helpful to split up the quotient that appears in the Ratio Test into a product of quotients of 
similar-looking terms. The calculation of the limits frequently works out nicely when you do 
this. 


Remark 4.5.18. You may have figured out from the above examples how to calculate the 
radius of convergence the “easy way”; the Ratio Test implies that, if you have the infinite series 


reo ce(z — a)*, and 


lim 
k—-00 


= R, 
Ck+1 


where R is an extended real number, then R is the radius of convergence of the power series. 


3k k2 
k=1 


Example 4.5.19. Consider the last series from Example|4.5.17 S- o ) 


We calculate 


1/ (3*k?) 
1/ (3*+1(k + 1)?) 


3ktl (k+1)? 
= lim LE ee 


i lint k—-00 3k k2 


k— oo 


as we found in Example|4.5.17 


exists, 


It’s important when using the method of Remark |4.5.18} that the limit jim 


Ck4+1 


k ie, that you're 


at least as an extended real number, and that cj is the coefficient of (x — a) 


looking at all of the powers of (x — a). 


Consider, for instance, the series 
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Note that the odd terms are missing. It might be tempting to think that the radius of convergence 
can be calculated from the coefficients of the non-zero terms, and look at 


1/9" 
1/9k+1 


lim 
k—-00 


However, the radius of convergence is not 9. Why not? 


You can apply the Ratio Test to the series and determine that the series converges when 


ln +7|? |r +7|? 


< 1 and diverges when > 1. Therefore, the radius of convergence is J/9 =3. 


In general, it is not particularly interesting to have functions defined by power series that 
converge only at their centers; such a function has a single point in its domain and, at that 


single point, the function equals the constant term in the series. Not too useful. 


Therefore, our interest is really in functions which equal power series on a non-empty open 
interval around the center a of the series. As we shall see in the next section, this necessarily 


means that the function equals its Taylor series, centered at a. We give a name to such functions. 


Definition 4.5.20. Suppose that f is an infinitely differentiable function on an open subset 
U of R. Let V be an open subset of U, and let a be a point inU. 


Then, f(x) is real analytic at a provided that, for all x in some open interval around 
a, Tr°(x;a) converges to f(x). This is equivalent to saying that f(x) is equal to some 


(convergent) power series, centered at a, on an open interval containing a. 

The function f is said to be real analytic on V provided that f is real analytic at x, 
for allx in VY. A function which is real analytic on its whole domain is simply called a real 
analytic function. 


Example 4.5.21. Suppose that we have a power series p(z) = )\p-.9 cx(x — a)", centered at a, 
with a positive radius of convergence R > 0. Then, the function defined by p(x), on the interval 
of convergence of p(a), is, by definition, real analytic at a, since p(x) equals the convergent 
power series )>-°9 ck(x — a)* on (at least) the non-empty open interval (a — R,a + R). 
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Remark 4.5.22. There is a subtle, but extremely important, point in the definition of a function 


being real analytic on an open subset or, more specifically, on an open interval. 


Suppose that f is real analytic at a. This means that there exists an open interval J, around 
a, such that, for all x in J, f(x) = T?°(a;a) or, equivalently, that there is some power series 
eo Ce(@ — a)* such that f(x) = 7¢29 ce(a — a)* on a non-empty open interval I around a. 


The question is: does this mean that f is real analytic on J, i.e., if b is in J, must it be true 
that f is real analytic at b? 


Your tendency might be to say “Sure — f equals its Taylor series for all x in J, and 6 is in J, 


so f equals its Taylor series at b”. 


The problem here is that what we mean by “its Taylor series” should not be the same both 
times in the sentence above. The first time that we wrote “its Taylor series”, we meant “its 
Taylor series centered at a”. However, f being real analytic at b means that the Taylor series 
of f, centered at b, converges to f(a) for all x in an open interval around b. In other words, 
being real analytic at different points requires you to switch the center of your Taylor/power 
series to match the point under consideration, and then you need that the Taylor series Te (a; b) 
around this (possibly) new center b also converges to f(a) on some open interval around 6; this 
interval around b need not be the same open interval J on which T?°(x; a) converged. 


Fortunately, there is a theorem, Theorem 8.4 of [3], which tells us that, in fact, we don’t 
really have to worry about the subtle point discussed above. 


Theorem 4.5.23. Let R be a positive extended real number. Suppose that T?° (x; a) con- 
verges to f(x) for all a in the open interval I defined by |x —a| < R. Then, for all b in I, 
T?°(a;b) converges to f(x) for all x such that |x — b| < R—|b—al, that is, for all x in the 
largest open interval, centered at b, which is contained in I. 


As a special case, if T?° (x; a) converges to f(x) for all real numbers x, then so does 
TP (a; b), for every real number b. 


In particular, if Tp°(x;a), or any power series peo ce(z — a)*, converges to f(x) for 
all x in an open interval I, centered at a, then f is real analytic on I. 


Example 4.5.24. We know that e”, sinz, and cosz are equal to their Maclaurin series, for all 


x. Theorem|4.5.23}tells us that being equal to their Maclaurin series on the interval (—oo, oo) in 
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fact implies that, on the entire real line, these functions are equal to their Taylor series centered 
anywhere. Of course, we could have determined this by showing that the Taylor error, with 
an arbitrary center, approaches zero, but Theorem |4.5.23] gives it to us without any additional 


work. 


Example 4.5.25. Let’s look at a function which has a Maclaurin series with a positive, finite 
radius of convergence. Consider the function f(x) = In(1+4+ 2), for all « > —1. Is this function 


real analytic on the entire interval (—1, co)? 


Your immediate reaction may be to say “no; In(1+ 2) equals its Maclaurin series only on the 
interval (—1,1] ”. This is true, but, somewhat surprisingly, irrelevant. The question of being 
real analytic on the interval (—1, 00) is: for all a > —1, is it true that In(1+ 2) equals its Taylor 
series centered at a on some open interval around a? That is, we allow the center of the series 
under consideration to change. Being real analytic does not require being equal to a single 
power series everywhere. Yes - being equal to a single power series on an open interval implies 
being real analytic; this follows from Theorem [4.5.23] But a function can be real analytic on an 
interval and “require” you to use Taylor series with different centers to represent the function 
and see the real analyticity. 


If we take as given (even though we haven’t proved it yet) that In(1 +) is real analytic on 
the open interval (—1,1), then, if we show that, for all a > 0, In(1+ 2) is real analytic at a, we 


will have shown that In(1 + 2) is real analytic on the interval (—1, 00). Let’s do this. 


Suppose that a > 0, and let f(x) = In(1+ 2). As we saw earlier, in the proof of Theo- 
rem|4.4.12| for k > 1, f(a) = (—1)F¥-1(k —1)(1 + «)-*. 


Therefore, if « £ a, for n > 1, there exists a c, strictly between x and a, such that the Taylor 
error satisfies the following equality: 


n+1 n+1 


fo (0) 
(n+ 1)! 


1 
n+1 


n! 
(n+ 1)! 


r—a 
l+c 


r—a 
l+e 


ar 


Bq) = | (e- 


What we want to do is produce an R > 0 (not necessarily the radius of convergence) such that, 
if |jz —a| < R, then |x —a|/|1+¢| <1, for then the limit lim,_.. E?(x) will be 0, which would 
show that In(1 +2) equals its Taylor series at a on the non-empty open interval (a — R,a+ R). 


But this is easy; let R be the minimum of the two values 1 and a, i.e., R = min{a, 1}. Then 
R<aand R< 1. Suppose now that |x — al < R, ie., suppose that a— R<a<a+R. Then, 


certainly, |2 — a| < 1. In addition, since R < a, a— R > 0 and we must have x > 0, which, 
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together with a > 0, implies that c > 0, so that 0 < 1/(14+c) < 1. Therefore, |z—a|/|1+c¢| < 1, 
and so the Taylor error approaches 0. 


Thus, we have shown that In(1+ 2) is real analytic on the interval (0,00), which, when 
combined with the real analyticity on (—1, 1), shows that In(1 +) is real analytic on the entire 


interval (—1, 00). 


The following theorem tells us that, if a real analytic function on an open interval can be 
extended, a.k.a., continued, to a real analytic function on a larger open interval, then there’s only 
one way to do it. The analog of this theorem in the setting of complex numbers is well-known, 
but it is difficult to find many references for this fact in the real numbers; we prove the result 


in Theorem |4.A.1 


Theorem 4.5.26. (Real Analytic Continuation) Suppose that f and g are real analytic 


functions on an open interval I, and that f(x) = g(a) for all x in an open interval J, which 
is contained in I. Then, f and g are equal on I. 


Remark 4.5.27. You may think that Theorem |4.5.26]is obvious, and must surely be true for 
any “nice” functions, like infinitely differentiable functions; surely we don’t really need real 


analytic functions. Actually...we do. 


Recall, from Example |4.4.11} that the following function is infinitely differentiable on the 


entire real line: 
e V2" ife>0; 
0, ifae <0. 


Certainly the function f(x) = 0 is infinitely differentiable on the entire real line, and f(x) = g(x) 
on the open interval (—co,0). Nonetheless, f(x) and g(x) are not equal at all points in the 
interval (—oo, 00). 

This does not contradict Theorem |4.5.26| since, as we saw in Example |4.4.11} the function 
g(x) is not real analytic; more specifically, g(a) is not real analytic at x = 0. 
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4.5.1 Exercises 


In each of Exercises 1 through 6, determine whether the given sequence converges 
or diverges. Note that, as we are interested in what happens as n — co, we don’t 
bother specifying a starting value of the index. Assume that the indexing begins 


at a high enough value so that all of the terms are defined. 


1 _ 5n? + 3n 
- bn = —F a: 
5n? + 3(—1)” 
2. an = ———>- 
7n2 
_1)2y2 
ere ae 
7n2 
2 
i ar 
ni 


5. An = cos(nm/4). 


n2 


~ 1000n + 1° 


In each of Exercises 7 through 12, determine the convergence or divergence of the 


given series. 


°°. 1000 
> a 
k=1 


k=1 
7 > an g 
k=1 
= k24+1 
10 (-1)* mi 
k=1 : 
k=1 
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13. 


14. 


15. 


a. Find the radius of convergence of the power series 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 


. Use the first 3 terms of p(x) to estimate the values of p(1.1), p(1.01), and p(2). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 


. Find the radius of convergence of the power series 


p(x) = 5 Vi (a-+3)! = (+3) + V2 (a+3)° + V3 (x+3)° + V4(e+3)° +++. 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 


. Use the first 3 terms of p(a) to estimate the values of p(—2.9), p(—2.99), and p(—2). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 


. Find the radius of convergence of the power series 


2 3 4 


mes XL 1 x x x v 
— 1)* = 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 
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16. 


17. 


. Use the first 3 terms of p(x) to estimate the values of p(0.1), p(0.01), and p(1). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 


. Find the radius of convergence of the power series 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 


. Use the first 3 terms of p(x) to estimate the values of p(4.1), p(4.01), and p(5). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 


. Find the radius of convergence of the power series 


o. (a — 4)" 
pa) = 


k=0 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 


. Use the first 3 terms of p(x) to estimate the values of p(4.1), p(4.01), and p(5). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 
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18. 


19. 


20. 


. Find the radius of convergence of the power series 


=r ae 
k=1 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 


. Use the first 3 terms of p(x) to estimate the values of p(4.1), p(4.01), and p(5). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 


. Find the radius of convergence of the power series 


eye yet (e+7)* 
kk 
k=1 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 


. Use the first 3 terms of p(x) to estimate the values of p(—6.9), p(—6.99), and p(—6). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 


. Find the radius of convergence of the power series 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 
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21. 


. Use the first 3 terms of p(x) to estimate the values of p(—6.9), p(—6.99), and p(—6). 


Explain why, in this exercise, unlike in the exercises from the previous section, it is 
not possible (in any easy way) to compare your estimates in part (c) with the actual 
values of p(x) from your calculator. 

Which of your estimates in part (c) do you expect to be good estimates of the actual 
values of p(x)? Why? 


Find the radius of convergence of the power series 


oo k 
rz) = a 
p(x) 2! 1) Ti 


. Given the center of the series and the radius of convergence from part (a), what are 


the possibilities for the interval of convergence of p(x)? 


. Use the first 3 terms of p(x) to estimate the values of p(0.1), p(0.01), and p(1). 


. Explain why, in this exercise, unlike in the exercises from the previous section, it is 


not possible (in any easy way) to compare your estimates in part (c) with the actual 


values of p(x) from your calculator. 


. Which of your estimates in part (c) do you expect to be good estimates of the actual 


values of p(x)? Why? 


22. Suppose that a power series p(x) converges at « = 1 and at x = 3, but diverges at « = —2 


23. 


24. 


and « = 5. What can you say about the center of the power series and the radius of 


convergence? 


Suppose that someone tells you that they have a power series which converges at x = 1 


and at x = 3, but diverges at « = 2. What can you conclude? 


In this exercise, you will show that f(a) = i 


Is d : 
is real analytic at all points a, except for 
—2x£ 


a=1l,ie., f is real analytic on the union of open intervals (—oo, 1) U (1,00). 


a. 


b. 


Find a formula for the k-th derivative, f (2). 

Give Lagrange’s form of the error for EF (a;a), where a £ 1. 

For each a # 1, find an r > 0 such that, for all x in the open interval (a —r,a+r), 
the limit of your error from part (b) is zero, i-e., such that 1/(1 — x) = Tp°(a; a) for 
all x in (a—r,a+r). (Hint: Choose r < |1 — a|/2.) 

Explain why 1/(1—<) being real analytic everywhere, except at 1, doesn’t contradict 
the fact that the Maclaurin series for 1/(1 — x) converges if and only if |x| < 1. 
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> 
4.6 Power Series as Functions II: 


Operations on Power Series 


In the previous section, we discussed what it means for a power series to define a function. 
We looked at the interval and radius of convergence, we stated the Ratio Test, and looked at 
examples of using the Ratio Test to determine radii of convergence or, in some cases, intervals 
of convergence. We also discuss real analytic functions, functions which are locally equal, on 


open intervals, to their Taylor series at each point. 


In this section, we will look at why power series functions are so nice to deal with. We will 
see that, as far as differentiation, integration, and algebraic operations are concerned, power 


series can be treated like polynomials, polynomials that just don’t ever end. 


For a number of technical proofs in this section, we refer you to the excellent real analysis 
book of Trench, [4]. 


We will begin with the following basic theorem on continuity, due to Abel. 


Theorem 4.6.1. Power series functions are continuous, t.e., continuous on their intervals 


of convergence, including at possible endpoints that are included in the intervals of conver- 
gence. 


Proof. See [4], Theorem 4.5.12. 


Remark 4.6.2. As we shall see below, power series functions are differentiable on the interior 
of the interval of convergence, i.e., on the open interval (a — R,a + R), where a is the center 
and R is the radius of convergence. As differentiable functions are continuous, the main point 
of Theorem |4.6.1] for us, is that, if a power series p(x) converges at one or both of the endpoints 
of the interval of convergence, then the one-sided limits of p(a) as you approach the endpoint 
equal the value of p(a) at the endpoint. 


Why is this important? Suppose, for instance, that we have a function f(x) which we know 
is continuous on the interval (—1, 1] and we have a power series p(x) which we know converges on 


the same half-open interval (—1,1]. Suppose, further, that we know that, on the open interval 
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(—1,1), f(x) = p(x). Then, we can immediately conclude that, in fact, f(x) = p(x) on the 
entire half-open interval (—1,1]. Why? Because Abel’s theorem, Theorem |4.6.1} tells us that 


p(a) is also continuous on the interval (—1, 1], and so 


Fl) = lim f(x) = lim p(x) = pI), 
where the middle inequality follows from the fact that f(a) = p(x) on the interval (—1, 1). 


For example, below, we shall use integration of power series to show that, for all x in the 


open interval (—1, 1), 


In(l+2) = S°( oe a Ges goto (412) 


Looking ahead to results in Chapter [5] we find that the series on the right, above, converges 
when x = 1 and diverges when x = —1. As In(1 + 2) is continuous, and Abel’s theorem tells us 


that the power series is also continuous, we conclude that 


ok = ee a =a: el) 
. oe eS 3 4 5 ra 


2 3 z 5 6 
In other words, the equality given in Formula |4.12| which was for —1 < x < 1, also holds 


when x = 1. This provides an alternative way of showing that the alternating harmonic series 
converges to In 2; something we saw back in Theorem |4.4.12] and Remark |4.4.13] using Taylor 


errors. 


In the remainder of this section, we will sometimes assume that we have an interval J on 
which a power series converges. This does not necessarily mean that J is the entire interval of 


convergence; J could, in fact, be a smaller interval inside the interval of convergence. 


We want to look now at substituting into power series. Not all substitutions lead to new 
power series, but they give some infinite summation whose convergence or divergence can be 


discussed. 
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Example 4.6.3. Recall from Theorem that we have the following equality 


+ ak 1 rn an 
Om Deg tet ata t gh ae 
k=0 


for all x. Now, if x is some number, then 5(2—1)° is just some number, and we can stick it into 
Formula and obtain 


3 3\3 3 
(ste—1)%) 3, (5( — 1)%) (5(a — 1)%) _ G@- 1?) 
‘ ae ee 21 3! 4 aan 
25(a@—1)® 125(@—-1)9  625(2—-1)” 
3 
14+ 5(@—1)’+ aI a 3] ste mT ate 
Hence, we obtain a power series, centered at 1, which converges to e(5(@—1)*) for all x. 


Is this series necessarily equal to the Taylor series, centered at 1, for e(5(@-1)")? Yes, though 
we won't really “know” this until we have Corollary [4.6.8] 


Generalizing the above example, we have: 


Theorem 4.6.4. Suppose that p(x) = \>p2.9 cn(w —a)* converges on an interval I, and we 
have a second function f(x). 
Then, for all x such that f(a) is in I, we have the following equality 


p(f(z)) = >> eel F(z) - a), 
k=0 


in which the infinite sum converges. 


490 CHAPTER 4. POLYNOMIALS AND POWER SERIES 


In particular, if f(a) = r(a—b)™ +a, where r and b are constants and m > 0 is an integer, 
then, for all x such that r(x — b)™ +4 is in I, we have the following equality 


p(r(a —b)"+a (cyr®) (a — b)™ 


in which the sum on the right is a convergent power series. In addition, the new interval of 
convergence consists precisely of those « values such that r(a — b)™ +a ts in I. 


Example 4.6.5. Consider the geometric series from Theorem |4.4.10} with a = 1: 


1 Co 
=JSivt alert et et atts, 


where the interval of convergence of the series, and the interval on which the equality holds, is 


the open interval (—1, 1). 


By substituting —3(# + 2) in for x, we immediately obtain that 


isiery > a 3(x + 2)) oe )F3¥ (a +2)* = 


where the equality holds, and the series converges, on precisely the interval of those x’s such 
that —3(a# + 2) is in the interval (—1,1). To determine this interval in a more standard form, 


you need to manipulate the inequalities 


-1 < -3(@+2) < 1. 


Divide through by —3, reversing the inequalities, to obtain 


t2< =. 
< & 3 
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Now, subtract 2 everywhere, 


1 1 
a Se ee ee 
ae os 


to find that the new interval of convergence is (—7/3, —5/3), which is centered at —2 and has 
radius 1/3. 


Example 4.6.6. Recall from Theorem that, for all x, 


Now, it is a standard trig identity that sina = cos (z = it), and $-z£= (a 9) is the same 


when raised to an even power as x — % to the same power. Therefore, we obtain 
p 5) p ; 


Note that we obtained this same equality in Remark|4.4.8} using our calculation of the Taylor 
series for sinx, centered at 7/2, from Example 


You should find the last line of the example above to be interesting. In Remark|4.4.8} using 
Example |4.2.8| we found a power series, centered at 7/2, which equals sin. In Example 
we produced a power series, centered at 7/2, which equals sina, by substituting 5 — x into the 
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Maclaurin series for cosx. What we found is that, despite the fact that we obtained the power 


series in completely different ways, we arrived at the same power series. 


This may leave you wondering: “given a particular center a, can you have two different 
power series centered at a which converge on some open interval around a and which define the 
same function?” Or, putting it another way: “if p(a) equals a power series, centered at a, on 


some open interval around a, is that power series centered at a unique?” 


As we shall see in Corollary for a fixed center, if a function equals a power series on 
an open interval around the center, then that power series is unique, for the given center. Of 


course, changing the center will change the power series. 


The uniqueness of the power series representation of a function follows as a corollary to the 
theorem below, which tells us that you differentiate a power series as though it’s a polynomial, 


i.e., by using linearity and the Power Rule on each term. 


Theorem 4.6.7. (Differentiating Power Series) Let R be the radius of convergence of 
p(x) = rg ce(w — a)* and let I denote the interval of convergence. 


Then, the radius of convergence of the series \ >, kcx(a — a)*~1, obtained from p(«) 


from term by term differentiation, is also R and, for all x in the open interval (a—R,a+R), 


p(s) = S > keg (a — a)*1. 
k=1 


If the series for p(x) diverges at one or both of the endpoints of I, then so does the 
series for p'(x). However, convergence of the series for p(x) at an endpoint of I does not 
necessarily imply convergence of the series for p'(a) at that endpoint. 


Proof. See Theorem 4.5.5 of [4]. 


Note that the new summation in Theorem begins at k = 1, since the k = 0 term cp in 


p(x) has 0 for its derivative. 


After you apply Theorem [4.6.7] to a series p(x), you have a new power series for p’(x), which 
again converges for all x in the same open interval J. Thus, you can differentiate p'(a) and 
its power series, term by term, to produce a new power series for p(x). You can iterate this 
process any number of times, n, in order to produce a power series for p‘” (x); this power series 
will still be centered at a, converge to p) (x) on the open interval J, and have as its constant 
term n!c,. The real proof of this last fact is by induction, but it is easy to see by writing out 


the first few derivatives. 
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p(x) = eo +e,(4 —a) + e2(2 — a)? + c3(2 — a)? + ca(a — )* +.e5(2 — a)? +:--. 


p'(x) = 2cg+3-2¢e3(x —a)+4-3c4(a — a)? +5-4e5(2 —a)?+---. 


p'' (2) = 3-2c3+4-3-2ca(z —a) +5-4-3e5(4—a)?+---. 


pY(a) = 4-3-2c44+5-4-3-2cs(a@—a)+---. 


Now, the constant term of a power series is exactly what you get when you evaluate the series 
at the center, x = a, since then all of the (a —)’s are 0. Thus, for all n > 0, p\ (a) = nlen. At 


this point, we can write k in place of n, if we want. 


Thus, we obtain: 


Corollary 4.6.8. Suppose that p(x) = \ yp. ce(a — a)’ converges on an open interval I, 
which contains a. : 
p*)(a) 


Then, p(x) is infinitely differentiable on I and, for all k, cy = a? and, hence, the 


original power series must, in fact, be the Taylor series for p(x), centered at a. 
In particular, if a function equals a power series, centered at a, on an open interval 
around a, then that power series, centered at a, is unique. 


Remark 4.6.9. You should have learned in high school that, if two polynomials are equal, then 
all of the coefficients of the various powers of x must be the same for the two polynomials. It 
may not have been clear to you at the time, but what they were telling you was that, if two 
polynomials define the same function, then the coefficients of each power of x must be the 


same, i.e., the abstract algebraic polynomials must be the same. 
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The uniqueness statement in Corollary |4.6.8] is a generalization of this. It tells you that, if 


you have an equality of power series functions 


Sie a)t = Yren(e— a), 
k=0 k=0 


for all x in an open interval around a, then, in fact, the coefficients in front of the various powers 


of (2 — a) must be the same, i.e., for all k, by = cy. 


Example 4.6.10. Theorem and Corollary tell us that we could have obtained the 


Maclaurin series for cos by differentiating, term by term, the Maclaurin series for sin x. 


We have : 6 
= grktt oe 2 
i = 1) = cor 
ee 2 "Orapl 7 * ate At 
and so 
0° : 2k 0° , ak 
3 ej St = 
cost = ) (-1OR+ Day = LCD op 
k=0 k=0 
x2 dl 76 ae 


which, of course, is what we knew we’d get. 


However, we can also use differentiation to obtain equalities that we didn’t know before. For 


instance, we know that, for -—1 <a <1, 


1 co 
= ye = l+ete +a? +---. 


Taking derivatives, we obtain that, for -l <2 <1, 


1 co 
G—2)? =) => 1424+ 3274+4e7?+---. 
k=1 
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Remark 4.6.11. This is a good time to remind you of Remark Suppose that f is 
infinitely differentiable and defined at 0. If f is an even function, then only even-powered terms 
appear in the Maclaurin series of f. If f is an odd function, then only odd-powered terms appear 
in the Maclaurin series of f. This is true whether or not f is actually equal to its Maclaurin 
series on an open interval containing 0. The proof of this was outlined for you in Exercise [37] of 
Section 


You can make a stronger statement for power series functions. A power series function p(z), 
centered at 0, with a non-zero radius of convergence, is even (respectively, odd) if and only if 
the only terms which appear (with non-zero coefficients) in p(a) are the terms whose power is 
even (respectively, odd). You are asked to show this in Exercise [39] 


One slightly subtle point is that, for a function to be even or odd, for every x in the domain, 
it is required that —zx is also in the domain. For a power series function, centered at 0, this 
means that if one endpoint of the interval of convergence J is included in J, then the other 
endpoint must also be included in J. That is, for power series, centered at 0, containing only 
even, or only odd terms, the interval of convergence is either of the form (—R, R) or [—R, R). 


Example 4.6.12. Given a fixed center, the fact that power series representations of functions 


are unique, and must be the Taylor series, can save you a lot of work. 


Consider the function f(x) =e”) = e*”. 

Suppose that you want to calculate the Maclaurin series of f barehandedly, i.e., from the 
definition of Maclaurin/Taylor series. To do this, you need f(*)(0), for all k, but the higher-order 
derivatives of f get pretty ugly pretty fast. 


2 


f(a) = Ine, 
f"(a) = 2Aa-2xe™ +e™) = 2(2a? +1)e™, 
f"'(@) = 2|@2? + 1)2xe™" + Axe® | = 4a(2x? + 3)e”’, 


and so on. 


However, Corollary tells us that, if we can produce by any means an equality between 


x 


e* anda power series, centered at 0, on an open interval, then that series must be the Maclaurin 


. 2 
series of e” . 
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That certainly makes things easier. By substituting x? in for x in the Maclaurin series 
equality for e”, Theorem|4.4.5| we obtain, for all z, 


2 oo (a?)* _ 2, at aa 78 
OS i Se oer at ee 
k=0 


. ; ; : ae 
for all x. Hence, this series must the Maclaurin series for e” . Nice, huh? 


Using Theorem in reverse tells us how to anti-differentiate a power series function 
p(x), provided that we know that the series obtained from term-by-term anti-differentiation of 
the series P(a) actually converges. Once we know how to anti-differentiate, then, since power 
series functions are continuous by Theorem we can apply the Fundamental Theorem, 
Theorem [2.4.10] and calculate the definite integral between two points xo and 2, in the interval 
of convergence of p(a) by evaluating P(x1) — P(ao). 


Theorem 4.6.13. (Integrating Power Series) Let R be the radius of convergence of 
p(x) = rg ce(w — a)* and let I denote the interval of convergence. 


Then, for every constant C, the radius of convergence of the power series P(x) = 
Cre awe a)**1, obtained from p(x) from term-by-term anti-differentiation, is also 
R, and, for all x in the open interval (a — R,a+ R), 


_ Ck _yk+I 
[r@dr = 0+ ie Gh) es 


If p(x) converges at one or both of the endpoints of I, then so does P(x); hence, P(x) 
exists and is continuous on I, making P(x) an anti-derivative of p(x) on I (Definition|2.4.5). 
However, divergence of p(x) at an endpoint of I does not necessarily imply divergence of 
P(x) at that endpoint. 


Proof. See Theorem 4.5.8 of [4]. 
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Remark 4.6.14. Note that, while differentiation may destroy convergence at the endpoints of 
the interval of convergence, integration can lead to convergence at the endpoints, even when the 


original series diverged at the endpoints of the interval of convergence. 


Example 4.6.15. Let’s look again at our old friend, the basic geometric series: For —1 < x < 1, 


1 [oe} 
—_ = baa = l+aet+e?+ait+-:., 


We'll first make a substitution; we’ll replace all of the x’s with —z’s, including the z in the 


inequality. We obtain: 


for -—1 < -x <1, 


1 Co oo 
= —Z) = —l)"a" = l-a@+ae*—-xe+a°-—at+::. 
k 1) k 1 2 3 4 5 


Multiplying the inequality throughout by —1, and reversing the inequalities, we see that the 
inequality still amounts to —1 < x < 1, and so the equalities above hold on this interval. 


Now, we anti-differentiate both sides of the equality to obtain that, for -—1 <a <1, 


for some constant C. 
How do you determine C? Plug in the only x value where the series is easy to evaluate: the 
center, x = 0. We find 
In(1+0) = C+0+0+0+---. 


Therefore, C' = 0, and we have, for -—1 <a <1, 


oe ght nr no x x 
In(l+z) = >°( Dare mime ae at ec ek 
k=0 
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This is the proof of this equality for -—1 < x < 0, which we stated, but didn’t prove, back 
in Theorem Note that this proof does not instantly give you that the equality also 
holds when x = 1, though our Taylor error proof in Theorem [4.4.12] did give us the equality 
at 2 = 1. If we ignore our earlier proof of the convergence of the alternating harmonic series 
and, instead, use another result (like the Alternating Series Test, Theorem [5.4.17] to show that 
the alternating harmonic series converges, we can obtain the x = 1 equality by using Abel’s 
theorem on continuity, as we discussed in Remark [4.6.2] 


Note that integration changed divergence at x = 1 in the original series into convergence at 
x = 1 for the integrated series. Of course, this means that term-by-term differentiation of the 


Maclaurin series for In(1 + x) destroys convergence at x = 1. 


Example 4.6.16. Let’s start from one of our series equalities from the last example: 


for -l<a<l, 


1 Cc 
= So(-D ak = l-2x+a?-zta2t—a2? +... 
x 
k=0 


This time, we substitute x? in for x to obtain: 


for -1 < a? <1, 


1 
= S0(-1)Fa? = 1-2? +at-—2% 42% —2l+..., (4.14) 


The condition that —1 < x? < 1 once again is equivalent to —1 < x < 1, and so we have the 
above equalities for all x in the open interval (—1, 1). 

Now, recalling that (tan~' x)’ = 1/(1+ 27), let’s anti-differentiate the equality in For- 
mula to obtain 


o0 okt 3 5 7 9 11 
tanec = C+ S( 1)° = Cig at fey y 
k=0 


2k+1 


Once again, plug in the x value of the center of the series to determine C. We find that 
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C = tan~1!(0) = 0, and so, we arrive at 


tanta = ae = oe 4.15, 
Ba DE age SO gE gag keto) 


for -l<a<l. 


Of course, tan~! is continuous, and so, using Abel’s theorem again, we can conclude that 


this last equality holds at « = +1, provided that we know that the series actually converges 
at « = +1. In fact, the same Alternating Series Test, Theorem that tells us that 
the alternating harmonic series converges also tells us quickly that the series in Formula [4.15] 
converges at both « = +1, and so the equality in Formula [4-15] holds for all x in the closed 


interval [—1, 1]. 


One cool “application” of this is that it implies that 


Hence, 


bot 2. 
=A4(1 | 7 
. ( 3°75 F°S ) 


and we could use this to approximate 7 by using partial sums to estimate the infinite sum. This 


is actually not a very efficient method for approximating 7. For instance, we find 


L.t 21.2 2,24 0.2 Pr 2.2 
nwa ate erg a a ee a) =” 3.041839618929, 


which is not very close to 3.141592653590. To get reasonable accuracy, you would have to use a 


partial sum with many, many terms. 


We now need to discuss algebraic operations with power series. What we shall see is that you 
add, subtract, multiply, and divide power series as though they are never-ending polynomials. 
However, the precise manner in which you deal with multiplication and division requires some 


discussion. 
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Theorem 4.6.17. Suppose that p(x) = op9ck(w@ — a)* and q(x) = Yop dk (x — a)* 
converge on an interval I. 
Then, for all x in I, we have the following equality of functions: 


p(x) + a(z) = So (ce +dg)(x@—a)*, 


k=0 


where, in particular, we mean that the series converges for all x in I. 


In words, we say that summations of two power series with the same center is performed 
“term-wise”. 


Proof. This follows at once from the analogous theorem for series of constants, Theorem|5.2.17 
which is left to you as an easy exercise in Section [5.2| 


The following theorem basically says that multiplication distributes over addition, even if 
the addition is of an infinite number of things. It follows at once from distributing in the partial 


sums, and using basic limit properties. 


Theorem 4.6.18. Suppose that p(x) = P-9 ce(a — a)* converges on an interval I, and 
that f(x) is a function defined on I. 
Then, for all x in I, OP f(2)cn(a@ — a)* converges, and 


In particular, if f(a) = r(a —a)™, where r is a constant and m > 0 is an integer, then we 
foe) 


obtain that r(a — a) p(x) equals the power series > rc. (a — a) 
k=0 


k+m 
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Example 4.6.19. Of course, you can combine Theorem |4.6.17| and Theorem |4.6.18}] to find 
power series representations of various combinations of known series. For instance, for all 2, 


oo geht oo geek oo 5 3 Pe 
1 5 — 5 SET 3 — | : => 
Rae geese > e+ i) * * > OR) Mees sail ® 


5 3 a 5 3 5 5 3 - 
8x2 + 3 xz 4 a z+ aT 6 Hi 


Note that, as we discussed in Remark|4.6.11} the odd function 5 sinz+3z cos x has a Maclau- 


rin series contained only odd-powered terms. 


Example 4.6.20. Power series manipulations can help you calculate limits. 


For instance, for all « 4 0, 


e*~—-1l-2x 1 a2 gt 
x = 3 [C-otite+ 5+ 5+F4 ~ 
1 oe ee 1 xz ¢ 
= bo oe a oe ge a 


Therefore, as power series are continuous, we find that 


i *_l-g | i 1 2 x? | ie idk 
1. ——_,,——- =o xa ao = at +O4F+-: = 2 


Example 4.6.21. In this example, we “cheat” and must use that Calculus and the theory of 
infinite series work perfectly well when you use complex numbers in place of real numbers. This 
means that, in this example, we want to allow numbers of the form a + bi, where a and b are 
real numbers, and i is a square root of —1, i.e., 7? = —1. 

Note that, when you look at positive integral powers of i, what you get repeats every four 


steps: 
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by 10, where we’re thinking of 6 being a real number (but it doesn’t have to be). In fact, we’re 
thinking of 6 as an angle (hence, the use of variable @); we shall see why in minute. Using what 


we wrote about how the powers of 7 work, we obtain 


292 30° 1464 PQ? 7866 i 


iO : a \ | 
€ = 1+ 10 + aI T 3I T Al T al 


cos? + isiné. 


This fundamental result, that 


e? = cosé + isin# (4.16) 


is of great importance in many areas of mathematics, physics, and electrical engineering; this 
equality is known as Fuler’s Formula. 


As a quick example of the usefulness of Euler’s Formula, let’s see how easily it lets you 


derive the angle addition formulas for sine and cosine. We will use that, if a, b, c and d are real 
numbers, then a+ bi = c+ di if and only if a=c and b = d. 


Suppose that a and @ are real numbers. Euler’s Formula tells us that 


e = cosa + isina and e% = cosG + isin#. 
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Multiplying, we obtain 


e’*-e® = cosacosf +i? sinasin @ + icos Gsina + icos asin f, 


and so 


ell@+8) — (cos a cos 3 — sinasin 3) + i( cos Z sina + cos asin 8). 


Applying Euler’s Formula one more time to e(¢+9) | we find that 


cos(a + 8) +isin(a+ 8) = (cosacos @— sinasin 8) + i(cos B sina + cos asin 3) 


and, therefore, matching the parts without the i’s (the real parts) and the parts with the i’s (the 


imaginary parts), we obtain the well-known angle addition formulas: 


cos(a + 8) = cosacos~— sinasin GZ 


and 
sin(a +) = cosZsina + cosasinZ. 


So cool. 


Example 4.6.22. Now that we have Euler’s Formula, we can also give the close relationship 
between hyperbolic sine and cosine, from Definition {1.4.1} and the usual, circular, sine and 
cosine. 


Euler’s Formula tells us that e® = cos@+isin 6. Replacing @ with —0, and using that cosine 


is an even function and sine is an odd function, we find that e~*® = cos@ — isin @. 


Therefore, we obtain 


cosh(i#) = e+e _ cos@ + isin@ + cos @ —isin@ = eee 
2 2 
and : 
sinh(i9) = ev? — @—10 z cos @ + isin@ — cos @ + isin 6 pee 


2 2 
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Or, replacing 6 with —i0, and using i? = —1 and the even- and odd-ness of cosine and sine, 
we have 
cosh@ = cos(i0) and sinh@ = —isin(76). 


There are still three important operations with power series that we want to look at: multi- 
plying two power series with the same center, dividing two power series with the same center, 


and composing power series, which, in fact, is best discussed in terms of real analytic functions 


(Definition |4.5.20). 


Example 4.6.23. Consider the two polynomials, centered at a: 


p(x) = bo +b1(2—a)+b2(x—a)? +b3(2—a)? and q(x) = co ter (x—a) +c¢2(4—a)* +¢3(4—a)?. 


Multiplying p(a) times q(a), and collecting powers of (x — a), you find 


p(x) q(x) = boca + (boc1 + bico)(a — a) + (beg + bic, + bze9)(a — a)? + 


(boc3 + byco + bec, + b3co)(a — a)? + higher-degree terms 


You shouldn’t memorize this, but hopefully you see the pattern: the coefficient in front of the 
(a — a)" term in the product is a sum of the products of coefficients of the two individual power 
series, and the indices of the coefficients from p(x) and q(x) always add up to n, the exponent 
of (c — a)”. It is not hard to understand why this is true: to obtain an (a# — a)” term in the 
product, you can take any term from p(x) of degree k, where 0 < k < n, and multiply it by 
a term from q(x) of degree n — k, i-e., take by (x — a)? + Ch_g(@ — a)"—* = dyen—z(a — a)". OF 
course, you get a summand of this form for each k such that O<k <n. 

This pattern actually continues where we have written “higher-degree terms”; it’s just that 


many of the coefficients involved in the terms of degree > 4 are zero, since those terms are zero 


(i.e., don’t appear) in p(x) and q(x). 


The same process that we used in the above example applies to infinite series. 
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Theorem 4.6.24. Suppose that p(x) = > p29 be(w@ — a)* and q(x) = WP ce(x — a)* 
converge on an open interval I. 
Then, for allx in I, we have the following equality of functions: 


p(x)-q(a) = Le ( hen] (a— a)”, 
k=0 


n=0 


where, in particular, we mean that the series converges for all x in I. 


Proof. This follows from Theorem |5.4.11,| combined with the fact that power series converge 
absolutely on the interior of the interval of convergence; see Theorem [4.5.9] 


Example 4.6.25. Find the 5th order Maclaurin polynomial of e” sinz, and use it to approxi- 
mate e°:' sin(0.1). 


Solution: 


We could do this “simply” by calculating the 5th order Maclaurin polynomial from the 
definition, Definition [4.2.2] However, instead, we will use Theorem [4.6.24] and just do the 


multiplication out to the degree 5 term. 


We have 2 @ at 2 
e = 1424 5 31 7 FA 
and 
, _ a 
snz = ©-— 31 + SI 


Don’t get confused by the missing terms in the sin x series; if it helps, explicitly insert zero 
times each of the even powers of «. However, you shouldn’t really have to write those terms in — 
just keep in mind that they’re there or, equivalently, that when a term is missing, you don’t get 
the corresponding product. Think: How do you get a degree 0 term in the product? You have 
to multiply a degree 0 term from e* times a degree 0 term from sinz; but there is no degree 
0 term in the sin series, so you don’t get a degree 0 term in the product. How do you get a 
degree 1 term in the product? By multiplying a degree 0 term from e” times a degree 1 term 
from sin x — yielding 1-2 — and/or by multiplying a degree 1 term from e* times a degree 0 term 
from sing — but this portion doesn’t appear since there is no degree 0 term from sinz. Just 


continue in this manner. 
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What you should find is (we’ve included the 0’s, in case you thought of it like that): 


1 1 i 1 
e"sing = 0+ (1+ 0}e+ (041+ 0)22 + ( +04 0) a! | (0 area 0) at 


As 0.1 is reasonably close to the center 0, we expect that a decent approximation of e?-! sin(0.1) 


is given by 
il 
3 


1 


(0.1)? ay 


0.1 + (0.1)? + (0.1)? = 0.110333. 


The calculator value of e°:+ sin(0.1) is 0.110332988730, so our approximation looks very good. 


Now let’s look at division. 


Example 4.6.26. Knowing the Maclaurin series for e”, sinxz, and cosz, can we determine 
Maclaurin series or, at least, the 4th order Maclaurin polynomials, for e”/ sina and e”/ cos x in 


an algebraic manner, rather than by taking lots of fairly ugly derivatives? 


For e*/ sin x, the answer is certainly no; you can’t have a Maclaurin series for this function 
because the function itself is undefined at « = 0. More generally, if you have two power series 
centered at a, p(a) and q(x), and you want a power series centered at a which equals p(x)/q(z), 
you at least need q(a) 4 0, ice., if g(x) = P29 ce (x — a)*, you need co # 0. 


So, what about e*/cosxz? Can we long divide one series into another to find 


: _@? 8 af a 
+ x pies 
= at a SI = do + dix + dox” + dga? + dya* +--- is 
av av x 
‘ota a 


The answer is that, yes, we could describe long division, but it’s actually nicer to work backwards 


from the multiplication. That is, we’re going to solve for the coefficients d, that make the 
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following equality hold 


wt gh yt 48 
1l+at+ + 


a St a 


(do + dix + doa” + dga* + dax* +--- 


Calculating the first few terms in the product, we find 


2 4 6 
(do + dyx + dou” + dgx? + dyx* +--+) (1 Spe 


1 d 1 d 
do + dix + (-y+a)o%+(-Gra) 8+ (G- Ftd) atte. 


4! 2) 


Now, we set this equal to the Maclaurin series for e”, as in Formula|4.17| match the coefficients, 
and get 5 equations and 5 unknowns, but the equations are easy to solve: 


1 1 dy 1 
d=1, q 1, — ay tae = op a + 4s = 3P 
and — ; 
2 i — 
ri ii 


Al" 


Solving the equations, in order, each new equation gives you a single new unknown, because 
you’ve already solved for the others. Therefore, it’s easy to obtain 


Thus, if f(v) = 


zx 
e 
——., then 
COS Z 


We should remark that the Maclaurin/Taylor series obtained by division has a non-zero 
radius of convergence, provided that the two series being divided do also; however, the radius 
of convergence of the quotient series may be strictly smaller than those of the series in the 


numerator and denominator. In particular, there is a problem if the denominator hits 0. But, 
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(4.17) 
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assuming that the power series q(x), centered at a, has a non-zero radius of convergence and 
that q(a) 4 0, then the continuity of power series functions implies that there is an open interval 


I, containing a, such that, for all x in I, g(a) 4 0. 


We now state the theorem which summarizes what we saw in the example above; see [4], 


pages 269-270, for a discussion. 


Theorem 4.6.27. Suppose that p(x) = Po bk (x—a)*, q(x) = p9 cx(v—a)*, that both 
series have a positive radius of convergence, and that co # 0. 

Then, there is a unique power series, r(x) = Y pp dk(x — a)*, centered at a, which 
converges on an open interval I, containing a, such that, for alla in I, q(x) #0 and 


The series r(x) is determined by the equality p(x) = r(x)q(a). 


As the final topic for this section, we want to discuss composition of power series functions. 


In many ways, this topic is awful to think about. 


Suppose you have two power series, p(x) = Yoo be(a—a)* and q(x) = 72 9 cj(x—b)’, and 


that they converge on open intervals J and J, respectively, containing their respective centers. 
Can you obtain a new convergent powers series for p(q(x)) by algebraically manipulating the 
given two series? Well...consider what you get when you substitute the series for g(a) in for the 


x in the series for p(x); you get 


k 
So bk —at >" o;(a — 8) 
k=0 j=0 


So, we have an infinite sum of arbitrarily large powers of a series, Now, we could raise a 


series to any power by iterating how we multiplied two series; this would be truly horrible. 
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Think about calculating 


( | a ann ) 
l¢rtoyoy opty... 


in this manner. But, even if we were willing to do that for each integral power, then we’d need 


to multiply by constants and add together an infinite number of infinite series. 


Even if we were willing and able to do this, we have ignored the issue of where the new series 
should be centered. Of course, we have indicated in the notation that we’re allowing different 
centers for p(x) and q(x), but, even if we took them both to be centered at a, it’s unclear 
that a is the best place to center the composition. In fact, discussing the centers of the series 
should make you think about possible problems with the domains and codomains involved in 


the composition. 


We specified the domains of p and q as I and J, respectively, and power series functions 
have the entire real line, R, as their codomains by default. Thus, we are discussing composing 
p:I—+Randq: J —R, but this may not be possible — for p(q(x)) to be defined, we would 
need for the value q(x) to be in I. Thus, we need to restrict the codomain of g to I, which 
requires us to restrict the domain of g to the set of those x’s such that q(x) is in I; this set is 
denoted q~1(I). However, the set g~1(Z) need not be an interval; it may consist of a union of 
non-intersecting intervals, and there’s no reason to expect that the original center of the power 
series q(x) will be in the restricted domain given by q~!(J). Therefore, there is no natural center 


for the composition. 


Ugh — composition of power series functions is a mess! The calculations that we’d have to 
perform are practically impossible and, in the general case, it’s not even theoretically possible 
to make sense of the thing that we’re trying to calculate. 


However, recall our definition of real analytic functions from Definition [4.5.20] A large part 
of the point of a real analytic function is that, while it can locally be written as a power series, 
the center of the power series that it equals is allowed to move. This makes a statement about 
compositions of analytic functions easy, but you should understand that this easy statement, 
while being theoretically nice, does not, in any way, make it easy to actually calculate power 


series representations of compositions of real analytic, or power series, functions. 


Theorem 4.6.28. Compositions of real analytic functions are real analytic. 


More precisely, if U, V, W are open subsets of real numbers, and g : U — V and 
f:V—-W are real analytic functions, then fog:U — W is a real analytic function. 
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Example 4.6.29. So, for instance, Theorem |4.6.28] tells us that sin(e*) and e°°* are real 
analytic on the entire real line, i.e., for all real numbers a, these functions are equal to their 


Taylor series, centered at a in some open interval around a. 


However, Theorem gives you no simple way of finding power series representations 
of these functions. If you want the first few terms of the Maclaurin series of these functions, 
denote either by f(x), it’s probably easiest simply to use the definition of the Maclaurin series 
and calculate the coefficients f*) (0) /k!. 


4.6.1 Exercises 


In each of Exercises 1 through 12, (a) manipulate the known Maclaurin series 
representations/equalities of e”, sinz, cosxz, 1/(1— 2), n(1+ 2), and (1+ 2)? to find 
Taylor series, centered at the given a, which equal the given function f(x) on some 
open interval containing a, and give the largest open interval on which you’re certain 
that the equality holds, and (b) use the first three non-zero terms of your Taylor 
series from part (a) to estimate the value of f(z) at the given x, and compare 
with the answer from a calculator. In part (a), “give” the series either by writing 
the general series using summation notation, or by simply writing out the first 5 


non-zero terms and then +---. 


1. f(a) = 3cosx—5sinz,a=0,7=0.1 eo 


2. f(x) =e"? + sin(z —2),a=2,2=2.1 
3. f(x) = : ,a=2,7=1.9 
1—4(x — 2) 
4. f(z) = e*+1 4+ (2+2)7,a=-1,2=-12 
5. f(z) =aln(1 +27), a=0, 2 =0.2 
6. f(x) =(@-1)mnz,a=1,r=1.1 
7. f(x) =tan-1(2+2),a=-2,2=-2.1 
8. f(x) =atan-+(2 +2) = (a@+2)tan“!(~ +2) —2tan(@+2),a=—-2,r=-2.1 
9. f(x) =1/(1-—2)?, a=0, e = —0.1 (Hint: Let f(x) = 1/(1 — 2) and consider f”(z).) 
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10. f(#) = 78+25,¢=0,¢=0.1 
ll. f(x) =cos(/a —4),a=4, 4 = 4.01 


is. f= | aa 


dt,a=0,2#=0.1 


In each of Exercises 13 through 21, differentiate and anti-differentiate the given 


series, giving the result in summation notation (preceded by a big C+ for the anti- 


derivative) and writing out the first 5 (not counting the C+) non-zero terms of the 


summations. Give the center and radius of convergence of the original series, and 


of the derivative and anti-derivative series; note that the given series are those from 


Exercises 13 through 21 of Section [4.5} 


5¥(a — 1)* as 


= 5 
13. p(z) = DS = B(@-1) + gy @-1? + gp le-1) + Ble-1* + 
k=1 
14. p(z) = D> Vk (e+3)* = (@+3) + V2(e+3)" + V3B(e+3)° + V4(e43)* + 
k=1 
oo k 2 3 4 
_ , 2& ol x x x x 
ae) 2 1)"42 727 Bao + Bed 7 HD+ By 
~~ k(n — 4)* 
16. p(x) = 2 7erI 
= (2 — 4)F 
Tt. p(z) ~ > Tk+1 
k=0 
= (2 —4)* 
18. p(2) = 2 Tkt+lk 
k= 
= ay (e+ 7)" 
19. p(x) = So(-1)' 1 { au 
= ki(2 + 7)* 
20. p(x) = S¢(-1)**1 ct ) 
oe k 
21. p(z) = YS (-1)**! = 
p(x) 2| ) Tk 


In each of Exercises 21 through 26, write out the first 5 non-zero terms of the 


product of the two given series, using the same (common) center as the given 


series. 
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= O(a —1\r = (=D ie=iy 
22. pa) = S12 xl ) and aa) = S74 ) . ) 

k=0 : k=0 , 

= por — k 
23. p(a) = So(-1) pat and q(x) = ss 

k=0 k=0 

okt oo 

24. p(x) = (- (Ok +1) and q(x) = se 


25. p(x) = ay and oe g 


k 
= 1 (w — 3)* ~ 1 (xz —3)* 
26. neo) = 1-9 ST and g(0) = (0) = ye 


lee) __ a)k oo —_ k 
27. p(«) = eye and q(x) = y ( 3 


28. Multiply the Maclaurin series for sinz and cosx together, and write out the first 5 non- 
zero terms. You should get the Maclaurin series for asin(ba), for some constants a and b. 
What are a and b? 


In each of Exercises 27 through 32, write out the first 4 non-zero terms of p(x)/q(x), 


using the same (common) center as the given series. 


BS okie )6 BO (4h — 7)8 
29, p(x) = and aa) = S74 2 “ p 
k=0 k=0 
oo gktl am 
30. p(x) = ey 14 and q(x) = > 
k=0 k=0 
31. p(x) = he ai and q(x) = ya 
k=0 k=0 
32. p(x) = 2)" ea and q(x) = 
k=0 k=0 
~ 1 (x — 3) = 1 (x — 3)* 
33. p(w) =1+ 2) ( = and q(x) = p(#) =1+ yo = 
oo opie k co r— k 
34. p(t) = yen and q(x) = » : am 
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30. 


36. 


37. 


38. 


39. 


AO. 


Al. 


42. 


43. 


What quadratic polynomial best approximates cos(./xz), when «x is close to 0? 


What cubic polynomial best approximates tan~!(x +5), when x is close to —5? 


What polynomial of degree 5 best approximates x In(1+ 27), when z is close to 0? ec 


(w-1)? 


What polynomial of degree 4 best approximates e , when z is close to 1? 


In this exercise, you are asked to prove the statements made in Remark/4.6.11 


a. Suppose that a power series, centered at 0, with radius of convergence R, has only 
even-powered terms or only odd-powered terms. Prove that the interval of conver- 
gence is either (—R, R) or [—R, RI. 

b. Prove that a power series function p(x), centered at 0, with a non-zero radius of 
convergence, is even (respectively, odd) if and only if the only terms which appear 


(with non-zero coefficients) in p(x) are the terms whose power is even (respectively, 
odd). 


Recall Remark }1.1.23} the function f(x) = e~*” has no elementary anti-derivative. How- 
ever, elementary functions involve finite combinations of certain basic functions, and 


power series are infinite sums. 


. . . . . Sind 2 
a. Find a power series anti-derivative of e~*” 


b. Use the 3rd order Maclaurin polynomial from your answer in part a) to approximate 


fo e-® de. 


The function f(x) = cos(x?) has no elementary anti-derivative, but we can find a power 
series anti-derivative. 
a. Find a power series anti-derivative of cos(x”). 
b. Use the 3rd order Maclaurin polynomial from your answer in part a) to approximate 
Vis cos(x?) dz. 
The function f(x) = a) has no elementary anti-derivative, but we can find a power 


series anti-derivative. 


a. Find a power series anti-derivative of sin(«?). 
b. Use the 3rd order Maclaurin polynomial from your answer in part a) to approximate 


[ sin(x?) dx. 


In this exercise, we lead you through the steps of proving the Binomial Theorem, Theo- 


renlta Recall that @ = 1 and, fork > 1, (%) = p(p—1)(p—2)---(p—k+1)/(K!). 


514 


CHAPTER 4. POLYNOMIALS AND POWER SERIES 


. Let 


f(z) = > (A) 


k=0 
Use the Ratio Test to show that f(a) converges (absolutely) for || < 1. 


. Calculate the Maclaurin series for f’(x). 


. Show that, for |a| < 1, 


(l+a)f'(«) = pf(2). 


. Solve the separable differential equation in the last part, and use the initial data that 


f(0) = 1, to show that, for |x] <1, f(z) = (14+ 2). 


44, Square both sides of Euler’s Formula, Formula|4.16} e“ = cos@ + isin. What well- 


known identities, using only real numbers, do you obtain for sin(26) and cos(20)? 


45. Cube both sides of Euler’s Formula, Formula|4.16| e”? = cos@ + isin@. What not-so- 


well-known identities, using only real numbers, do you obtain for sin(30) and cos(36)? 
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> 
4.7 Power Series Solutions 


of Differential Equations 


One important application of power series functions is using them to find solutions to differential 
equations and/or initial value problems (IVP’s). We discussed differential equations and IVP’s 
at length in the final chapter of [2], before we had power series at our disposal. 


How do power series help you solve differential equations and/or IVP’s? You assume that 
there is a power series solution to the differential equation, centered at the x-coordinate of the 
initial data in the case of an IVP. You then plug in a power series with unknown coefficients, 
and this frequently leads to a manageable, but infinite, set of simultaneous equations to solve, 
or from which to derive a general formula for the coefficients. Your solution is then the power 
series that you end up with, or an approximate solution is given by using the first few non-zero 


terms. 


Sometimes, you may recognize your power series solution as being equal to some combination 
of standard functions, i.e., you may recognize it as an elementary function when you’re finished, 


but this is not something to be expected in general. 


Example 4.7.1. Let’s begin with a separable differential equation, one that we can solve 


explicitly, so that we can compare with the power series solution. 


Consider the IVP: 
—— F 0) =5. 
da Ys (0) 


The differential equation is separable. We find 


1 
[iu = [eae 
y 


Thus, In|y| = 27/2 + C and, after some algebra, we find y = Ae®’/?, for some constant A. 
Plugging in the initial data, we find that A = 5, so that the unique solution to the IVP is 


y= pe? /2, 


Okay. So, how does the power series approach go? 


516 CHAPTER 4. POLYNOMIALS AND POWER SERIES 


We look for a power series solution to the IVP. Since the x value of the initial data is 0, we 
look for a power series centered at 0. Thus, we want to find the coefficients bp, b1, bo, ... so 
that 


y = y(x) = So dbyw® = by + byw + box? + bgx* +++ 
k=0 


is a solution to the IVP. 


First of all, we immediately know that we need bp = 5, because we have 


5 = y(0) = lo +0+0+04-:- = bo. 


To find the rest of the b,, we take our power series expression for y, and the differential 


equation 


becomes an equality of power series functions, centered at 0. As power series functions are 
unique, by Corollary and as we discussed in Remark |4.6.9| we can match coefficients on 
both sides of the equation, and solve for the by’s. The only slight difficulty is in re-indexing the 


summations, but we’ll get to that in a minute. 


Since 
y(“) = Yas, 
k=0 


we use the Power Rule, term-by term, and find 
co 

y = x kbya®—1. 
k=1 


If we want our index to be the power of x (as usual), then, as an intermediate step, let 7 = k—1, 
so that k = 7+ 1. As k goes from 1 to co, 7 will go from 0 to oo, and so 


yo = So kbyo** = SOG + Udy’. (4.18) 
k=1 j=0 


Now, 


CO co 
ry = x: S- ba® = S- beat *, 
k=0 k=0 
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and, once again, if we want to index by the power of x, we let 7 =k +1, so that k = 7-1, and 
this time we find 


Co 
ry = SS bj-127. 
j=l 


Combining this last equation with Formula|4.18} we find that our differential equation gives 


us 


We would like to say, at this point, that we now match the coefficients of the powers of x on 
both sides of the equation. However, one summation starts at 7 = 0 and the other starts at 


j =1. But this is no real problem; we simply split off the 7 = 0 term, and write 


by + SOG + bj 4107 = 3 bj-127. 
j=l 


j=1 


Now we can match coefficients. The constant term on the left is b; and there is no constant 
term on the right; hence, b; = 0. For j > 1, the coefficient of x7 in the summation on the left 


must equal the coefficient of the x? term of the summation on the right; thus, for 7 > 1, we have 


(f+1)bj41 = b;-1 or, equivalently, 6;41 = ; (4.19) 


and we already know that bp = 5 and b; = 0. 


Equations like that in Formula are frequently referred to as iteration equations and, 
when you have some of the initial b, values, hopefully, the iteration equation is enough to give 


you the rest. 


How does this work? Well...a real proof would be by using mathematical induction once or 


twice, but it’s really not hard to see what’s going on by looking at the first few coefficients. 


Let’s look at the odd-numbered coefficients first. We have b; = 0. Now, Formula|4.19} with 
j = 2, tells us that 
bo-1 by 


= = 0. 
2+1 3 


bs —— bo44 = 


Putting j = 4 into the iteration formula gives us that bs; = b3/5, but we know that bz = 0. So, 


bs = 0. And so on. It’s easy to see that the odd-numbered coefficients are all 0. 
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What about the even-numbered ones? We find, from the iteration formula, 


bo 5 
be SS 
2; 2 2? 
, bo 5 
- 4 QA 7 
Bie ao et pees 5 
an eo oe ia 812g’ 
and ; : 
b = = Bs 


8 2:4-6-8 24.1-2-3-4° 


Hopefully, you see the pattern here; letting n equal half of the even 7 value, we have, for n > 0, 


5 
bon = 
; 2°n! 
Thus, our power series solution to the IVP is 
_ 5, ee 5 4 5 6 5g 5 on 
y= 5 + ay? + ga + gg? + wg® + = Leo? 


If we had to leave our answer like this we would, but it’s true that, even without looking 
back at our separable solution, you might notice that 


as we found by separating the variables. When finding power series solutions to arbitrary 
differential equations, you should not expect to be able to easily (or, even with difficulty) 


recognize that your solution has a “closed form”, i.e., has a form which clearly shows that the 
solution is an elementary function. 


As you can see in this example, had we started with more general initial data y(0) = yo, 


then yo would appear every place that we have a 5, i.e., yo would equal bo, and we would have 
found that y = yoe® /?. 
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But what if we had started with an initial x value other than 0? Suppose we had started 
with arbitrary initial data y(x9) = yo? In that case, you want to look for a power series solution 
which is now centered at x9. Why? For two reasons. First, you want your power series solution 
to be easy to evaluate at x9, and your typical power series is easy to evaluate only at the center. 
Second, in general, we only expect to solve initial value problems, and find a unique y = y(2), 
for x values close to the initial xg; for a power series solution, this means that we want to center 


at xq and find some non-zero, but possibly small, radius of convergence. 


What happens if we have the IVP y’ = xy, y(xo) = yo, and we take 


Co 
= n(x — 9)" 


We once again find immediately that bo = yo. That was easy. It’s also easy to find 


= y(@ — ag)* a SOG + 1bj41(e - 29). 


However, it’s slightly more difficult to write xy as a power series centered at xo, since, now, the 
x that we’re multiplying by is itself not centered at 2 . We fix this by writing « = (a —29)4+ 29. 


Now, 


sy = [(e©—20) +20] }) bk(w—a0)* = 
k=0 


S- be (a = ag)**4 + S- Lodge (x oe xo)* = ye bj-1 (a — xo)? + ye xob;(x = x0) 
k=0 k=0 j=l j=0 


Thus, our differential equation becomes 


foe) foe) 
SoG +b j5+1(L — Lo)? = Doel 1(@ — Xo) + S$ © xb; (a — 20) 
j=0 


j=0 
If we separate the two 7 = 0 terms, we obtain 


by + SoG + Dbj4i(@ — 20)? = = Zobo + Ye -1+209b;)(x — x0). 
j=l 
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Therefore, we obtain bp = yo (as we said before), b; = xobo = oyo, and, for all j > 1, 
@) + 1)bj44 = bj-1 + xob;, 


that is, 


re 
' . 


jgt+l 
You may now use this iteration equation to generate as many of the b;’s as you wish. It is 
unlikely that you’ll see a pattern; what you end up with should equal 


2 
—2Q oy? 


Y = ye? e?, 


It would be difficult, indeed, to look at the power series solution, centered at xg, and recognize 
the coefficients as those coming from this product. 


Example 4.7.2. Let’s look at a non-separable differential equation and IVP; one that we 
considered in Section 4.5 of [2]. 


Consider 
y =a-y y(j=-1. 
We want to look for a power series solution, centered at the initial x value, 1. 


So, suppose that 
co 
y = dob(w— 1). 
k=0 


Then, we immediately find that 


—1 = y(1) = bf +0+0+0+- 
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so that 


Differentiating, we obtain 


= So kbe(e—1)** = SOG + 1b @— 1) 
k=l j=0 


and the differential equation becomes 


CO 


SoG +) \bjgi(z@—-1)) = 2 - phe" 


j=0 


We now rewrite x as (x — 1) +1, in order to center it 1, and separate the degree 0 and 1 terms 


from the summations, to obtain 


by + 2bo(w@ —1) + S0(9 + U)bja(w—1)) = 14 (@—1) — bo —bi(w@—1) +S) -b; (x — 1) 
j=2 j=2 
Therefore, 
by — 1 — bo = 2, 
1 
2b. = 1-04, and so, 62 = ay 
and, for all 7 > 2, 
(j+1)b b ivalently, ei 
41 = —b; or, equivalen 41 = -—. 
J j+1 2) » eq ys +1 j4+1 
Let’s put off inserting that we know that b2 = —1/2 for a moment, and look at the other 
by’s. 
When 7 > 2, we find 
ie 
30 3 ’ 
bs 9 be 
b4 = = (-l 
4 4 ( ) 3. 4? 
ba be 
bs = = (-1)° 
7 5 oN  Baues 
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and so on. The denominators look like factorials, but they’re missing the multiplication by 2. 
However, that’s easy to fix; multiply each numerator and denominator by 2. Then, inserting 
that bo = —1/2, you should see (but the real “proof” requires induction) that, for k > 2, 


—2 2b Epr (-)* 
= k-2 702 = 
EE ge 
Therefore, our solution is 
oe es k 
y= -1+ %x-1)+ >> a ae (4.20) 
k=2 


We can write this in a closed form if we really want, because the summation looks like 


something closely related to the Maclaurin series for e”, and, in fact, it is. 


Replacing the « with —(# — 1) in the equality between e* and its Maclaurin series gives us 


Hence, 
oo _4)\k oo _4)k 
eer =. 7 (w—1)¥ = -1 + (2-1) + =. ~ Cs 
k=0 , nee , 
Thus, 
00 _4)k 
1 — (e¢-1) -e*t* = oe. ~ (g—1)*, 
k=2 , 


y = y(z) = —1 + 2(a-—1) + 1 — @-1) -e =r—-l-—e 


We’ve done enough work at this point that there’s been lots of room for small mistakes. 


Since it’s relatively easy, it might be a good idea to check that our solution to the IVP really is 
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a solution. 
We find 
y1) = 1-1-—e7"**' = 0-1 = -1, 
which means that the initial data condition is satisfied. Also, if y = 2 —1-—e7*+!, then 
y' =1+e-**!, and it is easy to verify that y’ = x — y. 


Thus, we have indeed found a solution to the given initial value problem. In fact, we found 
the unique solution and, in this particular example, uniqueness is not difficult to establish. 
However, in more general IVP’s, showing that the solution is unique is not so easy, and we will 


not go into the matter here. 


What we have shown is that the unique real analytic solution at, or near, x = 1, is 


y=a—1—e*!, 


Example 4.7.3. As our final example of this section, let’s look at a simple 2nd order differential 
equation, with no initial conditions. 
Consider 
y" = —4y, 


and we’ll look for a power series solution centered at 0. 


So, suppose that y = 7°79 b,x* is a solution to our differential equation. We quickly find 
co 
Sy hie 
k=1 


and 


yf! = So k(k-1)dpa*? = So +2)( \(g + 1)bj4227. 
k=2 j=0 


Therefore, we need 
lo) 


om (9 +2)94+1) bj 42x) = S 5 4b; 
j=0 j=0 
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It follows that y = 77°. be2* will be a solution to y” = —4y if and only if, for all j > 0, 


4b; 


a ONAN ioe = aa: a Bann. “baw set 
(j + G+ )bj+2 J or, equivalently. J+2 G+2)(¢+1) 


As the iteration formula above refers to an index which is 2 bigger on the left than the only b 
term on the right, there will be one formula for the odd-indexed coefficients and another formula 


for the even-indexed coefficients. In addition, bo and b; can each be anything at all. 


Looking at the even coefficients first, we find 


4bo 
bg = -—— 
2 2.1? 
b Abo (-1)? 4*bo 
= de® a5 92 4) 
b 43} 
bg — ( ie ° ’ 
6-5 6-5-4-3-2-1 
and, generally, for n > 0, 
4” bo g2n 
ben = (-1)” = by: (-1)” 
A similar calculation shows that, for n > 0, 
A" by g2n by g2nrl 
bontd = (-1)” —— 2= 0y- (-1)” = -(—1)” 
att = (—I) (2n +1)! ae (2n +1)! 3 (2n + 1)! 


Therefore, 


y= > bet = S- bpx® + S- bya” a bana aa > bons ae 
k=0 aor a 


k even k odd 


lore) ss g2n a co bi “4 g2n+1 peer 
dbo ( oro ane Deg l) Qnt+Di° 
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oS ’ or 2n by oo m or 2n4+1 
bo Dal " — LS px _ on 


If you look back at the equalities between cos z, sinz, and their Maclaurin series, you should 
see that we just found that 


b 
y = bocos(2x) + = sin(2x), 
which means that y(0) = bo and y’(0) = 6), which would be determined by initial data. 


We should remark that, above, when we split our summation into the even degree parts 
and the odd degree parts, we implicitly assumed that rearranging the order of the summation 
did not affect the sum; in general, this requires that the series converges absolutely, as we shall 
discuss in Section 


Before we leave this section, we want to point out that the examples that we gave were 


actually simple ones, though they may not have seemed so simple. 


Consider, for instance, the problem of finding a power series solution to the differential 


equation 


You can look for a power series solution, centered at 0, y = )>729 b,a®, but what do you do 
with the y? part??? Well...you square your series, that is, you multiply it times itself, and use 
the product series. Yes, it’s ugly, but it’s not ridiculously difficult to find at least the first few 


terms of the solution. 
What about solving 
y = (sinz)y + «x? 
Again, what you have to do is fairly unpleasant; you let y = 079 byv® and multiply this by 
the Maclaurin series for sinx to deal with the (sinx)y part of the problem. 


The point is: while solving differential equations via power series may be time-consuming and 


tedious, what you need to do, and actually doing it, is frequently /usually very straightforward. 
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4.7.1 Exercises 


In each of Exercises 1 through 10, you are given an initial value problem, in which 
the initial data is specified at a particular value x = zo. Assume that the initial 
value problem has a unique power series solution, y = )>;-, bk (x — xo)" (at least, 
for x values near x). (a) Determine iterative formulas for the b;. (b) Calculate bo 
through bg explicitly. (c) Approximate the value of your solution at x = x) + 0.1 by 
using the first 4 non-zero terms in your power series. 

L. y' =5y+a; y(0) =2 

2. y =Sy+2; yl) =2 

3. y! =5y +e"; y(0) =2 

4. y' =5y+ ae"; y(0) =2 


5. y’ =5y+a7%e7 +1; y(0) =2 


6. y’ =5y+sina; y(0) =2 eo 


7. y' =(x@—-3)?y4+1; y(3)=-1 


8. y" =4y; y(0) =1, y'(0) =0 
9. y” = 4y + 2e7; y(0) =1, y’(0) =0 
10. y” = (w@—2)y+1; y(2) =1, y/(2) =0 
In each of Exercises 11 through 18, you are given an initial value problem, with 


initial data specified at x = xj. Find a power series solution, centered at x9, and, by 


manipulating known Maclaurin series, rewrite your solution in a “closed” form. 


ll. y’ =5y4+a; y(0) =2 


12. y’ =Sy+a; y(1)=2 


13. y! = (a +2)?y; y(—2) =7 

14. yf =5y+e7—+; y(1) =2 

15. y’ = ay + ba; y(0) = 0, where a and b are constants, a # 0. 
16. y" = 5y’;_ y(—2) = 7, y'(—2) =0 


17. y” =4y; y(0) =1, y'(0) =0 
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18. y” = —k?y; y(0) = 1, y’(0) = 0, where k is a constant. 
19. Find the 3rd order Maclaurin polynomial of the solution y = y(a) to the initial value 
problem 
y=y ta; y(O)=1 
20. 


Find the 3rd order Maclaurin polynomial of the solution y = y(x) to the initial value 
problem 


/ 


y = ysing + 2; y(0)=1. 
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Appendix 4.A Technical Matters 


Theorem 4.A.1. Suppose that f and g are real analytic functions on an open interval I, and 
that f(a) = g(a) for all x in an open interval J, which is contained in I. Then, f and g are 


equal on I. 


Proof. By considering f — g, we are reduced to showing: 


Suppose f is real analytic on an open interval J, and is 0 on an open subinterval J C I. 
Then, f is 0 on all of J. 


We will use here that intervals are connected, i.e., that an interval is not the disjoint union 


of two non-empty open sets. 


We say that the germ of f is 0 at x if and only if there exists an open interval around x on 
which f is everywhere equal to 0; we write [f],, = 0. 


By definition, A = {x € I | [f]2 = 0} is an open subset of J. By our assumption that f is 0 
on J, A is non-empty. We claim that the set B = {x € I | [f]z #0}, which is the complement 
of A in J, is also open. As I is connected, it will follow that B is empty, i.e., that f is zero on 
all of I. 


Suppose that b € B. We wish to show that there is an open interval K around 6 such that 
kK C B. Suppose not. Then, by picking smaller and smaller open intervals around b, we can 
produce a sequence of points a; € A such that lim;...a; = b. As f is real analytic at each 
a; and is zero on an open interval around aj, all of the coefficients of the Taylor series of f 
at each a; must be 0, i.e., for all i, for all k, f)(a;) = 0. As the derivatives are continuous, 
f(b) = limj.o. f(a;) = 0. As all of the derivatives f‘*)(b) are 0, and f is real analytic at 


b, it follows that f is 0 in an open interval around 8, i.e., 6 ¢ B. This contradiction shows that 


B would have to be open and, hence, empty. 


Chapter 5 


Theorems on Sequences and 
Series 


As we have discussed in prior chapters, with various levels of formalism, a sequence of real 
numbers is essentially just an infinite list of real numbers, that is, for all integers n, greater 
than or equal to some initial integer m, we have a real number 6,,. A sequence converges if and 
only if lim, by, exists, where n is allowed to take on only integer values > m. Convergence 
of certain kinds of sequences is a fundamental property of the real numbers, which we shall not 


attempt to derive. 


An infinite series is what you get when you add an infinite sequence of terms; it’s a limit of 


partial sums. We will discuss many tests for deciding when such limits of partial sums exist. 


Unfortunately, while it is extremely important to distinguish between sequences and series, 
confusion between these two terms is common — perhaps stemming from the fact that both 
words start with “se” or, perhaps, because the partial sums themselves form a second sequence, 
in addition to the sequence of terms. Throughout this chapter, you need to keep in mind that, 
in an infinite series, you are trying to make sense of an infinite sum; this is the limit of the 


partial sums, not the limit of the individual summands (the terms). 
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> 
5.1 Theorems on Sequences ee" 


In our discussion and use of power series, and several other times throughout this textbook, we 


have used infinite sequences, “lists” of real numbers that never stop. In this section, we will 


state, prove, and look at examples of some fundamental definitions and results on sequences. 


Informally, a sequence of real numbers is a list of real numbers, the terms, such that there’s a 
first number, a second number, a third number, and so on — in general, there’s an n-th number 


in the list, for every natural number n. 


For instance, we sometimes list a few numbers in the list, with some obvious pattern, and 


then, once the pattern is clear, we indicate the remainder of the list with dots, e.g., 


where the n-th term is b, = 1/n. 


It is convenient in some cases to start the subscript, the index, at some number other than 


1; for instance, we might consider the sequence given by 


1 
= SS fe > 3. 
a (in—1)(n—2) orn >3 


This causes some mild confusion when speaking of the terms of the sequence, because it’s then 
true that the first term of this sequence is a3, the second term is a4, and so on. The problem is: 
if you said “the n-th term”, when using n for the indexing variable, it might be unclear whether 
you meant a, or Gn+2. For this reason, it’s best to talk about ay, which is very clear, rather 
than to use common English phrasing. Typically, we care about the limit of a sequence, as the 


index approaches oo, so confusing a, and a@y+2 is unlikely to cause a problem. 


The notion of a sequence as a list is informal. Really, a sequence is a function; you pick 
an integer (greater than or equal to some starting integer m), and you get back a real number. 
We gave the rigorous definitions of a sequence and its convergence/divergence in [2], and in 
Definition [4.5.1] however, we will give those definitions again here. 
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Definition 5.1.1. Suppose that m is an integer, i.e., is in the set Z. Denote by Z>m the 
set of integers which are greater than or equal to m. 

A function b: Z>m — R is called a sequence (of real numbers). In place of b(n), it 
is standard to write by. 

We say that the sequence b, converges to (a real number) L, and write 
limpsoobn = L if and only if, for alle > 0, there exists an integer N > m such that, 
for all integers n > N, |b, — L| <e. 

If a sequence does not converge to some real number L, then we say that the sequence 
diverges. 


The sequence b,, diverges to oo, and we write limn—o bp, = ~, if and only if, for all 
real numbers A, there exists N >m such that, for alln > N, by) > A. 

The sequence b, diverges to —oo, and we write limpn—oo bn = —0o, if and only if, for 
all real numbers A, there exists N >m such that, for alln > N, by < A. 


It is important that a sequence can have only one limit. 


Theorem 5.1.2. If a sequence b, converges to Ly and converges to Lz, then Ly = Ln, 1.e., 


if a sequence converges to some limit, then that limit is the unique limit. 


Proof. This proof is identical to the proof of uniqueness of limits of functions of a real variable, 


except that you allow only integers for the value of the variable. We refer you to the proof in 


Q. 


Remark 5.1.3. Because the same notation is used for limits over the integers and limits over 


the real numbers, some confusion can arise. 


Think about lim,,_,.. sin(nz). Do we mean the limit of the sequence a, = sin(n7), so that 
n takes on only integer values (with an unspecified starting value)? Or do we mean that n can 


be any real number? It makes a difference. 


If we mean that n varies over the real numbers, then lim... sin(n7) does not exist, because 


the sine function continues to oscillate between —1 and 1 as n increases through the real numbers. 


On the other hand, if n is an integer, then sin(n7) = 0, and so the limit lim,_,.. sin(n7) of 


the sequence sin(n7) is just lim, 0 = 0. 
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How do we avoid this confusion? There are several ways. One is to explicitly state that we 
are taking the limit of a sequence. Another is to refer to n’s that take on only integer values as 
discrete variables, and n’s that can take on arbitrary real values as continuous variables. Finally, 
it is common to use the letters 7, 7, k, 1, m, and n as discrete variables, and the letters r, s t, u, 
Vv, W, x, y, and z as continuous variables. One of these ways should always make it clear which 


type of limit is intended. 


For instance, many people would write simply 


lim sin(n7) = lim 0 = 0, 
n—- oo n—- oo 


while 


lim sin(x7) does not exist, 
«w— oo 


automatically assuming that n denotes a discrete variable and x denotes a continuous variable. 


In the above remark, we saw that, if we take the function f(x) = sin(a7), whose domain 
is the set of real numbers, then lim,_,.. f(a) does not exist, but if we restrict the domain of 
f to the integers, then lim,_... f(n) does exist. However, the theorem below tells us that, if 


limy_+oo f(x) exists, and equals L, then limy_.. f(n) also exists and equals L. 


Theorem 5.1.4. Suppose that M is real number and that f(x) is defined for all reala > M. 


If limg+oo f(x) exists as an extended real number, and equals L, then the limit 


limn—soo f(n) of the restriction of f to the integers > M exists as an extended real number 
and equals L. 


Proof. We shall prove the case in which L is a real number, and leave the cases where L = +00 


as an exercise. 


Suppose that lim,.. f(x) equals a real number L, and let € > 0. Then, there exists a real 
number N > M such that, if x is a real number and x > N, then | f(a) — L| < €. However, this 


is true for all real numbers > N; in particular, it’s true for the integers > N. 


This is precisely what it means for the sequence f(n) to converge to L. 
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Example 5.1.5. Theorem is very useful when trying to calculate limits of sequences which 


arise from restricting functions of a continuous variable. 


An easy example is to consider g(a) = 1/a. Then, we know that lim, .,, g(x) = 0. It follows 
that 


lm — = Q, 
noo TL 


where, in this latter limit, we mean the limit of the sequence. 


Example 5.1.6. Let’s look at something a bit more complicated. Consider the limit of the 


sequence n/e”. 


We’d like to use l’H6pital’s Rule (see [2]), but l’Hépital’s Rule requires us to take derivatives 
of the numerator and denominator. If n really takes on only integer vales, then we can’t take 


derivatives. 


However, consider, instead, the function of a continuous variable f(a) = a/e*. Then, 
V’Hopital’s Rule tells us 


and now Theorem tells us that, because the “continuous limit” above is 0, we also have 
the discrete limit equaling the same thing, i.e., 


Understanding that the existence of the continuous variable limit implies the existence (and 
equality) of the discrete variable limit, it is fairly standard to be a bit sloppy, and never write 
anything explicit about the continuous variable function, but rather simply apply l’Hopital’s 
Rule to the function containing the discrete variable n. That is, it’s pretty standard to write 


Just remember: there’s really a theorem at work here. 
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Many of the results that we proved for limits lim,;_,.. f(a) have analogs for sequences, which 
can be proved simply by replacing the continuous variable x by the discrete variable n; such 
results include arithmetic with limits, the Pinching Theorem, and composition with continuous 


functions, all of which we state below. 


Theorem 5.1.7. (Arithmetic of Convergent Sequences) Suppose that the sequences 
Gy, and by, converge, and that c is a constant. Then, 


1. limy +0 C=C; 

2. limpsoo(Qn + bn) = limpsodn + limps bn; 
8. iMy-+o0 Cn = C+ l1IMy +90 Gn; 

4. liMp—co(Gn + bn) = limpsoo dn + limMp—oo bn; 


5. lim an limy—oo an 


N00 Un, limn—oo bn 


, provided that limn_.oo bn # 0. 


In particular, the sequences Gn, + bn, Can, Gn by, and an/bn converge. 


Example 5.1.8. Since we know that 


lim +z = 0 and lim z= 0. 


Combining these with Properties 1-4 in Theorem tells us that 


Jim | (5 -) (s | 5) = (5—7-0)(344-0) = 15. 
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Just like functions of a continuous variable, sequences satisfy a Pinching Theorem. 


Theorem 5.1.9. (Pinching Theorem for Sequences) Suppose that, forn > m, we have 


the inequality of sequences ay, < by < Cn, and that limpn—oo Gn = limn—o Cn = L, where L 
is an extended real number. Then, limn—.oo bn = LD. 


Example 5.1.10. Consider 


As —|an| < Gn < |an|, the following is an immediate corollary of the Pinching Theorem, 
which is useful is special cases. 


Corollary 5.1.11. limp. |an| = 0 if and only if limp. an = 0. 


Example 5.1.12. Since we know that 1/n — 0 as n — ov, it follows from the corollary above 
that ise 
lim —) 


n—-00 n 


= 0. 
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You can compose continuous functions with convergent sequences (remember: sequences are 


technically functions) to produce new convergent sequences. 


Theorem 5.1.13. Suppose that limp. a, = L, where L is a real number. Suppose also 
that f(x) is a function whose domain contains all of the values ay, and that f is continuous 
at L. Then, the new sequence f(an) converges to f(L), i.e., 


lim f(an) = f (Jim an). 


n—0o 


Example 5.1.14. For instance, since lim,—...(1/n) = 0, 
. 1 
lim cos (+) = cos(0) = 1. 
noo n 


As another example of Theorem |5.1.13} consider the limit from Example 


im |(5 = 5) (3 n =)| = (5—7-0)(3 44-0). 


Instead of appealing to the arithmetic properties of limits of sequences, as we did in Exam- 
ple|5.1.8| we could let f(x) be the continuous function (5 — 7x?)(3 + 4x3), and then use Theo- 
rem |0.1.13] to obtain 


Before we can give any useful ways of determining the convergence/divergence of a sequence, 
we must discuss two fundamental properties which essentially define the real numbers. We refer 


you to [3] for more details. 


5.1. THEOREMS ON SEQUENCES 537 


First, we need two definitions. 


Definition 5.1.15. Suppose that E is a set of real numbers, i.e., a subset of R. Suppose 
that M is a real number (which may or may not be in E). 


Then, M is an upper bound of E if and only if, for alla in E, x < M. If an upper 
bound exists for E, then we say that E is bounded above. 

An upper bound M of E is the least upper bound or supremum if and only if M is 
the smallest upper bound of E, i.e., if N < M, then N is not an upper bound of E, i.e., 
there exists x in E such that N < x. 


M is alower bound of E if and only if, for alla in E, M <<a. If a lower bound exists 
for E, then we say that E is bounded below. 

A lower bound M of E is the greatest lower bound or infimum if and only if M is 
the biggest lower bound of E, i.e., if M < N, then N is not a lower bound of E, i.e., there 
exists x in E such thata < N. 

As we defined earlier in Definition |2.3.4, the set E is bounded if and only if E is 
bounded above and below. 


Remark 5.1.16. When we apply any of the terms in Definition |5.1.15]to a sequence, such as 
saying “the sequence a,, is bounded above”, we mean that the set F under consideration is the 
set {a, | n > m} of values of the sequence, i.e., the range of the sequence function. 


We also make the following, seemingly unrelated, definition. 


Definition 5.1.17. Suppose that b,, forn > m, is a sequence of real numbers. Then, the 
sequence b, is a Cauchy sequence if and only if, for alle > 0, there exists an integer 


M >m such that, for all j,k > M, |b; — bp| < €. 


Cauchy sequences, least upper bounds, and greatest lower bounds are related in that they 


all provide ways of defining what’s known as the completeness of the real numbers: 
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Theorem 5.1.18. Least Upper Bound Property & Completeness of the Real 
Numbers: 


1. Every non-empty set of real numbers, which is bounded above, possesses a least upper 
bound. 


2. Every non-empty set of real numbers, which is bounded below, possesses a greatest 
lower bound. 


3. A sequence of real numbers converges if and only if it is a Cauchy sequence. 


Remark 5.1.19. The greatest lower bound property follows very easily from the least upper 
bound property by negating everything, which interchanges upper bounds and lower bounds. 


All of the completeness properties in Theorem [5.1.18] fail if you look solely at rational num- 
bers. There are non-empty sets of rational numbers, which are bounded above/below by ratio- 
nal numbers, which have no rational supremum/infimum, and there are Cauchy sequences of 
rational numbers which do not converge to a rational number. Thus, any of the properties in 
Theorem [5.1.18]can be used to show why the rational numbers are “incomplete”. 


In fact, the completeness properties in Theorem actually tell us what “extra stuff” 
we want to combine with the rational numbers in order to obtain the real numbers. The 
real numbers are constructed from the rational numbers by, basically, letting real numbers be 
suprema/infima of bounded-above/below sets of rational numbers, or by letting real numbers 


be limits of Cauchy sequences of rational numbers. 


We should remark that the nature of our completeness properties is why we can frequently 
tell that sequence converges to something, without being able to say what the something is. 
The limit of the sequence may be a little mysterious when it’s described as the least upper 
bound, or greatest lower bound, of some set of real numbers, or as being the thing that a 


Cauchy sequence is converging to. 


Monotonic sequences will be especially important to us. These sequences are just monotonic 


functions which have their domains restricted to being sets of integers. 
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Definition 5.1.20. Suppose that b,, forn >m, is a sequence of real numbers. 


e b, is increasing if and only if, for all integers k and | such thatk >1>m, by > by. 


e b, is decreasing if and only if, for all integers k andl such thatk >1>m, by < by. 


e b, is monotonic if and only if it is increasing or decreasing. 


Remark 5.1.21. Note that increasing sequences are automatically bounded below by the initial 
term of the sequence, and decreasing sequences are automatically bounded above by the initial 


term of the sequence. 


Therefore, the assumption that an increasing sequence is bounded is equivalent to the as- 
sumption that it’s bounded above, and the assumption that a decreasing sequence is bounded 
is equivalent to the assumption that it’s bounded below. 


When you’re interested in whether or not a sequence converges, you can ignore any finite 
number of initial terms in the sequence. Hence, the following terminology is frequently conve- 


nient to use. 


Definition 5.1.22. We say that a sequence an, forn > m, eventually has a certain 


property if there exists an integer M > m such that the new sequence ay, forn > M, has 
the property. 


Thus, for instance, a sequence is eventually increasing if it is always increasing after some 


point. 


The following theorem gives one of the most basic ways in which one concludes that a series 


converges. 
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Theorem 5.1.23. Suppose that a sequence b,, forn > m, is eventually monotone. Then, 
b, converges if and only if it is bounded. 


More specifically, 
1. if bp is increasing for n >m, 
a. tf limpsoo bn = L, then, for alln > m, bm < by < L, and 
b. if, for alln > m, by < B, then limp. bn exists and is < B. 
2. if by is decreasing forn >m, 


a. if limpsoo bn = L, then, for alln >m, L < by < bm, and 
b. if, for alln >m, B < by, then limp bn exists and is > B. 


In particular, the only way for an eventually increasing sequence to diverge is to diverge to 
co, and the only way for an eventually decreasing sequence to diverge is to diverge to —oo. 
This means that, for every eventually monotone sequence b,, we can write limy+o0o bn = L, 
where L is an extended real number, i.e., a real number or +co. 


Proof. Assume that the sequence is increasing for all n > M. The decreasing case is completely 


analogous. 


Suppose that b, converges to L. We will show that, for alln > M, b, < L. Since the 
sequence is increasing for n > M, this will show that the sequence b, is bounded below by the 
minimum value in the set {b, | m <n < M} and above by the maximum of L and the maximum 
value in the set {b, | m<n< M-— 1}. 


Suppose, to the contrary, that there exists no > M such that b,, > L. Let ¢ = b,, — L > 0. 
As b, is increasing for n > M, for alln > no, 


b, -L > bn, —L > «. 


This contradicts that D is the limit of the sequence, and this contradiction establishes that 
sequence b,, is bounded. 


We need now to show the other direction in the proof. So, suppose that the sequence by, 
n > mis bounded. Then, the set E = {b,, | n > M} is bounded; let B be an upper bound of E. 
By Theorem |5.1.18} E has a least upper bound; call it L, which is necessarily < B. We claim 
that limy—.oo bn = L. 


Note, first, that, since L is an upper bound for EF, for alln > M, b, < L,ie., L—b, => 0. 


Now, let « > 0. Then, L — , which is less than LZ is not an upper bound of EF, i.e., there exists 
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no > M such that b,, > L—e. This means that LD — bp, < . As by is increasing for n > M, it 
follows that, for alln > ng, L—b, <¢. As L—b,, > 0, this establishes what we wanted: for all 
n> no, |\L—bn| <e. 


We still need to make an argument to justify the last statement in the theorem. Suppose 
now only that b,, is increasing for alln > M. Then, as we saw above, b, is bounded below by the 
minimum value in the set {b, | m<n< M}. As we also saw above, if the sequence b, diverges, 
then it’s unbounded; thus, it must not be bounded above. Pick some real number A, and let Ao 
be the maximum of A and 1 plus the maximum value in the set {b, | m<n< M-—1}. As by, 
is not bounded above, there exists n9 > M such that b,,, > Ao. As b,, for n > M, is increasing, 
it follows that, for all n > no, bn > bn, > Ao > A, which shows that b,, diverges to oo. 


Corollary 5.1.24. (Comparison of Monotonic Sequences) 


1. Suppose that, forn > m, ayn and by, are increasing, and ay, < b,. Then, if by, converges, 
80 does ay, and, in this case, limps Gn < liMp—+oo bn; in particular, if ap, diverges, 
then so does bn. 


. Suppose that, forn > m, an and by are decreasing, and ayn < by. Then, if an 
converges, so does b,, and, in this case, limpn—oo Gn < liMpn—+oo bn; in particular, if by 
diverges, then so does an. 


Proof. This is immediate from Theorem }5.1.23 


Example 5.1.25. Let’s look at an example which is relevant to the next section, where we look 


at infinite series. 


For n > 0, define sequences 


n 


1 “1 
an = a ae and bn = Dh 
k=0 k=0 


As we are adding something non-negative to get from one term of a, to the next, and something 
non-negative to get from one term of b,, to the next, the sequences a,, and 6, are both increasing. 
Moreover, clearly, we have ay, < b, (in fact, a, < bp). We also know from Theorem |4.4.10| that 


the sequence b, converges to 2. 


It follows from Corollary |5.1.24| that the sequence a, converges to something < 2, but we 
have no idea what that something is — though the something must also be greater than or equal 
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to every ad,. In fact, the limit of the sequence a,, is precisely the smallest number that is greater 


than or equal to all of the a,,’s, i.e., the least upper bound of the sequence. 


Consider now the sequence 1/n, for n > 1; this the harmonic sequence: 


This is the sequence 1/(2n), for n > 1. Notice that each of the terms in this even harmonic 
sequence appear in the harmonic sequence, and they appear in the same order, i.e., the even 
harmonic sequence is obtained from deleting terms from the harmonic series, without rearranging 


the remaining terms. 


Sequences obtained in this manner are call subsequences. 


Definition 5.1.26. Suppose that we have a sequence a,, forn > m. A subsequence of 
the sequence ay, is a sequence formed by taking some (or all) of the terms of ay in order. 
Rigorously, this means that a subsequence of an, forn > mM, is a sequence Gn,, where nx, 


for k > 1, is a sequence of integers such that ny > m and, fori <j, nm < nj, 1.€., the 
indices in the subsequence are a strictly increasing sequence of some of the indices from the 
SEQUENCE An. 


There is a theorem and a corollary about subsequences that will be of interest to us. 


Theorem 5.1.27. Jf lim,_... a, = L, where L is an extended real number, then the limit 
of every subsequence of ay is also L. 


In particular, if a sequence ay, has two (or more) subsequences which have different limits, 
then ay, diverges. 


Proof. While we could write this in full mathematical detail, that would obscure what’s really 


going on, which is very elementary. 
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Suppose that limp... dn = L. This means that you can make a, as close to L as you want 


(or, in the cases where L = too, you can make L as big or as negative as you want) by picking 
the index n to be big enough. But, this is true for every a, as long as the index is big enough; 
in particular, it would be true for the indices (which are big enough) of the a’s appearing in any 


subsequence. 


Recall from Theorem 5.1.23} that a monotone sequence always approaches a limit L, if we 
allow extended real numbers L. Combining this with the theorem above, we immediately obtain: 


Corollary 5.1.28. If a sequence a, is monotonic, and a subsequence of an has an extended 


real number L as its limit, then limn.o Gn = LD. 


1 
Example 5.1.29. Consider the sequence a,,, for n > 1, an = — + (-1)”. 
n 


If we look at the subsequence where n is even, i.e., where n = 2k, k > 1, we find that 


which clearly approaches 1 as k > oo. 


On the other hand, if we look at the subsequence where n is odd, i.e., where n = 2k — 1, 
k > 1, we find that 


sahd oe | 
which clearly approaches —1 as k — oo. 


As there are two sequences of a, which approach different limits, the (entire) sequence ay, 


diverges, by Theorem |5.1.27 
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5.1.1 Exercises 


In each of Exercises 1 through 15, determine whether the given sequence converges 
or diverges. If the sequence converges, find its limit. As we are interested in what 
happens as n — co, we do not specify an initial n value; assume that n is big enough 


so that all of the given terms are defined. 


i) 
i. 1 
n 


5 5+3n g 
n 


n2 


“ m2+1 


gn +n? 
Qn 
2n cos (22) 
n2+1 
n 
* 10lnn 
7. In(2n) — In(n + 1) 
2n 
Vie +3 


” 
) 


1 

(7+2 

n 

10. (+; 
n 

Osu" 


11. 


12. 


13. 


14. tan-!n 


15. n—Vn?+7 


In each of Exercises 16 through 19, use the Pinching Theorem, Theorem to show that the 


given sequence converges. 
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16. 2+(-)" 
n 


17. 


18. 


19. 


Ssinn + 7 
ae 
3sinn — 2cosn 
VYn+til 
4n + 3(—1)” 
n 


In each of Exercises 20-24, find two subsequences of the given sequence which converge to 


different limits. Conclude that each sequence diverges. 


20. 


21. 


22. 


23. 


24. 


25. 


2+ (-1)” 
sin (27) eo 
2n cos (“7) 
n+1 
1 1 2 2 3 3.4 4 
2? 2° 3? 37 4? 4? 5? 5? 


a 
S&S 
oS 
wee 


4 5 6 
0, 0, 0, 4, 0, 0, 0, 0, §, 0, 0, 0, 0, 0, 8, ... 


Suppose that a sequence a,, n > m, has the property that, for all n > m, ay < 7. 
a. Is it true that the sequence must converge to a number that’s < 7? Prove it or give 
a counterexample. 


b. Is it true that, if the sequence converges to L, then DL < 7? Prove it or give a 
counterexample. 


c. Is it true that, if a, is increasing, then it converges to a number that’s < 7? Prove it 


or give a counterexample. 


26. Suppose that a sequence a,, n > mz, has the property that, for alln > m, 3 < ay < 7. 


ic} 


a. Is it true that the sequence must converge to a number LE such that 3 < L < 7? Prove 


it or give a counterexample. 


b. Is it true that, if the sequence converges to L, then 3 < L < 7? Prove it or give a 


counterexample. 


c. Is it true that, if a, is monotonic, then it converges to a number LF such that 3 < 


L <7? Prove it or give a counterexample. 
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27. 


28. 


29. 


30. 


Suppose that the sequence a, converges, and that the sequence b, diverges. Prove that 


the sequence a, + b,, diverges. 


Give an example of a sequence a,, which diverges, and a sequence b,, which diverges such 


that the sequence a, + b, converges. 


Suppose that the sequence a, converges to a non-zero limit, and that the sequence by, 


diverges. Prove that the sequence b,,/a,, diverges. 


Perhaps the most famous sequence of all time is the Fibonacci sequence, F,, n > 0, given 
by: 
0, 1, 1, 2, 3, 5, 8, 18, 21, ..., 


where the sequence is given iteratively by specifying that Fo = 0, F, = 1, and, for all 
n> 2, 


Fy = Fh-it Fn-2. 


That is, after the first two terms, each term is the sum of the two terms before it. In this 


exercise, we will look at a closed (non-iterative) formula for F;,. 


a. Suppose that a and b are constants. Consider the sequence c,,, n > 0, given by 


_ 1+ v5)" _ , (1-v5\" 
Cn = a 5) T 5) 


Determine a and 0b so that cp = 0 and c; = 1. 


b. Show that 


c. Using the constants a and } that you found in part (a), show that that the sequence 
Cn in part (a) is, in fact, the Fibonacci sequence F;,. Since you know that co = 0, 
and c; = 1, to show that c, = F,, you need to show that c, satisfies the Fibonacci 


iteration formula, i.e., you need to show that,for n > 2, 


Cn = Cn-1 + Cn-2.- 


5.1. 
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d. Calculate 


— Fagi 
lita, =, 
noo Fy 


The value that you obtain for this limit is known as the Golden Ratio. 


While they may appear to be simply interesting mathematical curiosities, the Fibonacci 
sequence and the Golden Ratio surprisingly appear frequently in nature, for example, in 


the arrangements of leaves on a plant and in the spiral shell of a nautilus. 
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= 
5.2 Theorems on Series I: 


Basic Properties 


In the last few sections of the previous chapter, we looked at infinite sums in the guise of power 
series. For a power series function p(x) = 07-9 cx(@ — a)*, we are interested in convergence of 
the series for different x values. But, for any fixed value of x, a power series simply becomes a 
summation >> b, of a sequence of numbers b;, and we can forget that the sequence of number 
that we’re adding are of the form by = c;,(x —a)*; that is, we can forget that the sequence comes 
from the terms of a power series. Such a summation of fixed numbers is simply called a series or 
an infinite series; to emphasize that the terms are not changing as a function of some variable 


x, the terminology series of constants is often used. 


When we discussed series earlier, we put off any discussion of general convergence theorems, 
other than the Ratio Test, Theorem [4.5.13] though we did also look at the convergence of a few 
specific series; in these latter cases, we established convergence by showing that certain functions 
equal their Taylors series in Section [4.4] 


In this section, we will state, but (usually) not prove a large number of results on convergence 
and divergence of series of constants. Starting with an infinite sequence b;, for k > m, we add 
more and more terms of the sequence together in the partial sums, and see if these partial sums 
approach anything; if they do, then that thing is, by definition, the sum of the series ‘aan br, 
and we say that the series converges to, or equals, this limit. This means that a series is, in 
fact, the limit of a sequence, but it’s the limit of the sequence of partial sums, not the sequence 


of terms. 


The theorems that we shall state are usually referred to as tests for convergence or divergence. 


Even though we gave following definition before in Definition |4.5.3} it is appropriate to give 
it again here. After the definition, we also repeat some of the comments that we made earlier, 


because these comments are vitally important to understanding infinite series. 


Definition 5.2.1. Given a sequence by, fork > m, we define, for each n > m, the partial 
sum to be ae by, and the infinite sum or infinite series or, simply, series to be 


the infinite summation S~~_,,, bk, which, technically, consists of the sequence by, together 
with the summation instruction/symbol, telling you to add the sequence. The terms of the 
sequence being added are also referred to as the terms of the series. 
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The sum of the series or value of the series )°7,,, by is the limit as n — oo of the 


k=m k=m 


provided the limit exists, in which case we say that the partial sums converge, that the 


partial sums, and we write 


series converges, or simply that the infinite sum exists and is equal to the limit of the 


partial sums. 


If the limit of the partial sums does not exist, then we say that the series diverges or, 
simply, that the infinite sum does not exist. 


Note that there are actually two sequences associated with an infinite sum/series: the se- 
quence of things you’re adding, the terms, and the sequence of partial sums 8, = Spm, dr- 
What we’re interested in is what happens to the sequence of partial sums as n — oo. 


As with approximating Taylor series by Taylor polynomials, the partial sums of any infinite 
series are frequently used as approximations of the infinite summation, assuming that the infinite 


sum exists, i.e., assuming that the series converges. 


We are trying to define what an infinite sum should mean. Intuitively, it means the limit 
as you add more and more terms; this means that an infinite sum is the limit of the partial 


sums, NOT the limit of the terms. The terms are just the individual summands; they are 
not sums themselves. 


Remark 5.2.2. You may wonder about the difference between a series and its value or sum. 
Technically, a series is actually the sequence, together with the summation sign(s), indicating 
that you want to add the given sequence, and the value or sum is what you get when you actually 
do the summation, i.e., the limit of the partial sums. 


What’s the point? Well...even though )>72, ¢¢ = 1 (look back at Theorem |4.4.10] and use 
a = 1/2 and x = 1/2), you shouldn’t talk about the series that consists of just the number 1. 
The series means the sum written out as a sum of the terms; its value is just a number (provided 


the series converges). 


Fortunately, this distinction seems to cause no problem; when people discuss infinite series, 


there is always a clear sequence being summed. 


One last point, which may be obvious. When we write something like )>?° a = 1, there 
is no need to assert separately that the series )> 7°, * converges; we’ve already stated what it 


converges to. 
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Okay. So, given an infinite series of constants, how do you find the sum? You might say 
“that’s easy — just take the limit of the partial sums”. The problem, and it’s a big one, is that 
there are very few series, or types of series, for which we can write nice, manageable expressions 
for the partial sums. In fact, basically, there are only two kinds of series for which formulas for 


the partial sums are easy to obtain: geometric series and telescoping series. 


Using different terminology, we derived the formula for the partial sums of geometric series 
way back in Corollary |2.1.11| and we looked at geometric series themselves in Theorem |4.4.10 


However, we wish to restate these results now, using a different letter r for the base. 


Theorem 5.2.3. Consider the geometric series 4 ar® and the sequence of partial sums 
ay = yo gar", forme 0, 


1. Ifr #1, then, 


_ prt 


Sn, = a 


l-r 


2. If |r| <1, then the series converges to a/(1—r), that is 


3. If |r| >1 anda #0, then the series diverges. 


Example 5.2.4. A classic social problem arises when a number of people are at dinner, and 
there’s one piece of bread left. No one wants to take the entire remaining piece, and so each 


person rips off part of the bread and takes that piece, leaving some behind. 


Let’s suppose that we start with one piece of bread, and that each person who takes some 
bread takes 1/3 of the remaining bread. What do we get if we keep track of the total bread taken 
(as fractions of the initial 1 piece) and the total bread remaining, assuming that this thirding- 
process goes on forever (ignoring that this is physically impossible and/or that we would at some 


point split atoms and blow up everything). 

The first person takes 1/3, leaving behind 2/3. The second person takes 1/3 of the remaining 
2/3, so (1/3) (2/3) of the original one piece, leaving behind (2/3)(2/3) = (2/3)?. The third person 
takes (1/3)(2/3)?, leaving behind (2/3)(2/3)? = (2/3), and so on. 
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After n iterations of bread-dividing, we find that the total bread taken is 
1, 1/2) 1 f2y* 12? | ty ee 
BBS aay! a3 a3 


2 n 
while the bread remaining is simply (5) ‘ 


Hopefully, you see that the sum for the amount of bread taken is a geometric sum. Using 
the notation from Theorem we have a = 1/3, r = 2/3, and the summation above is s,_1, 
which by Theorem |5.2.3| is given by 


This, of course, is what we’d better get; the total bread taken should equal the initial 1 minus 
the bread remaining (2/3)”. 


Thus, not surprisingly, the infinite sum of all the bread taken (ignoring explosions) is 1, the 
entire piece of bread, and you can reach this conclusion by summing the series or by seeing that 


the limit of remaining bread lim,_...(2/3)” is zero. 


Example 5.2.5. Theorem|5.2.3}tells us instantly that 77° 9(1.1)* diverges, while 77° 100(0.9)* 
converges to 100/(1 — 0.9) = 1000. 


What if these series hadn’t been written with the general summations formulas? For instance, 


what if you were given 


100 + 100(0.9) + 100(0.9)? + 100(0.9)? + ---? 


Technically, the first four terms don’t determine the series, but sometimes we rely on your being 
able to see “clear” patterns. You are supposed to see that you go from each term to the next by 
multiplying by the same thing each time — here, 0.9. Once you see that each term is obtained 
from the previous one by multiplying by the same thing each time, the series is a geometric 
series, and the thing that you’re multiplying by each time is the r in Theorem 5.2.3] The initial 
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term — here, 100 — is the a in Theorem Now that you've identified the series as being 
geometric, and have determined the r and the a, you apply Theorem 


Suppose the indexing variable doesn’t start at 0. What can we say about the conver- 


gence/divergence of 


Me 


5 
i 
iS) 


(1.1)* and S " 100(0.9)* ? 
k=2 


There are two options here. One is to reindex, and the other is to realize that leaving off the 
first two terms from our original series (the series that started at k = 0) changes each partial 
sum by the same fixed amount. Let’s look at both approaches. 


e Suppose we let 7 = k— 2, so that, when k starts at 2, 7 starts at 0. Then, k = 7 + 2, and our 


two sums become 


oy Se 
k=2 j=0 j=0 


and 

S ~ 100(0.9)* = S~100(0.9)7*? = S~ 100(0.9)?(0.9)'. 

k=2 j=0 j=0 
Even though the indexing variable is now j, instead of k, that doesn’t change anything. In fact, 
at this point, if you really wanted to, you could replace the 7’s with k’s. Either way, you should 
see that both series are geometric, with r still equaling 1.1 and 0.9, respectively. Hence, the first 


series diverges, while the second series converges to 


100(0.9)2 


——— = 810. 
1—0.9 i 


If you’re worried that you would never think to switch indices by letting 7 = k — 2, don’t 
let that bother you; if you just wrote out a few terms, you’d be able to rewrite the summation 
with a different index, or you’d recognize the series as a new geometric series, with the same r, 


but a different a. For instance, you’d get 


yy = (1.1)? + (1.1)? + (1.1)* + (1.1)? +, 
k=2 


and should immediately see that you have a geometric series with r = 1.1, since that’s what you 
multiply by to get from one term to the next, and the initial term is a = (1.1)?. This would 


be enough data to answer any geometric series question, but, if you really want to write this 
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geometric series, using an index which starts at 0, you would get 


Co 


(aye)? or y(uy2(.ay 


=0 k=0 


— 


or you could use some other indexing variable. 


e Another method for dealing with 


Sra)! and > 100(0.9)* 
a k=2 


k=2 


is to realize that these are the same series as those starting at k = 0, except that the first two 


terms are missing from the sum, i-e., 


yr. = ( 0) ~ (s20-0") 
0) 


k=2 


and 


5? 1000.9)" = ( yssa095) = ( Sram), 
k=2 k=0 


In fact, these subtractions exist on the level of the partial sums, i.e., 


and 


5 100(0.9)* = ( sen095) = (smo), 
k=2 k=0 k=0 


It follows that the divergence of }>?°.5(1.1)* implies the divergence of )7?°.,(1.1)*, and the 
convergence of )*;°., 100(0.9)* to 100/(1 — 0.9) = 1000 implies that S>?°., 100(0.9)* converges 
to 

1 
1000 — ( S- 100.9) = 1000 — (100+100(0.9)) = 810, 
k=0 


as we found before. 
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What we observed in the example above about leaving off a finite number of terms in a series 


is worth stating as a proposition. 


Proposition 5.2.6. Suppose that b,, fork > m, is a sequence of real numbers and n > m. 
Then, >-7_,,, bk converges if and only ee by converges, and when the series converge, 


So be = So bk + Ls bp. 
k=m 


k=m k=n+1 


In words, omitting or including a finite number of terms in a series does not affect 
whether or not the converges or diverges; in the case of convergence, the sum of the series 
is changed by the sum of terms which are omitted/included. 


Remark 5.2.7. Because of Proposition [5.2.6] when we are discussing whether or not a series 
converges, but not precisely what it might converge to, we sometimes write merely “the series 
>> bp”, knowing that the sum goes out to oo and that, as far as convergence/divergence is 
concerned, it doesn’t matter where we start the summation (though it is tacitly assumed that 
the series starts at some m such that by, is defined for all k > m.) 


Example 5.2.8. Before we leave the topic of geometric series, we should mention their rela- 


tionship with repeating decimal expansions of real numbers. 


It is beyond the scope of the subject matter in this book to prove that every rational number, 
ie., every quotient of two integers (where the denominator is not 0), has a terminating or 
repeating decimal expansion, but, given a repeating decimal, our knowledge of geometric series 


does tell us how to write the repeating decimal as a quotient of integers. 


Consider the real number given by the infinite repeating decimal 


123.45678678678678. 
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We would like to know what this is as a fraction. Let’s call the number x. Then, x — 123.45 = 
0.00678678678, and so 
100(a — 123.45) = 0.678678678. 


Therefore, if we knew 0.678678678 as a fraction, we’d be able to write x as a fraction. 


However, 0.678678678 means the sum of the geometric series 


_—s- 678 678 678 678 
0.678678678 = Poses, 
1000 * (10002 * (000)? * (000) 


and we know how to sum this. It’s a geometric series, with a = 678/1000 and r = 1/1000. Thus, 


678. 
0.678678678 = —“- = —100_ _ O78 
I-r 1-4, so999 


This same argument, applied to the general case of a repeating block, which starts in the 
1/10’s place, shows that such a repeating decimal is equal to the fraction with the repeating 


block in the numerator and a denominator with a matching number of 9’s as digits. 


As for our original x, we see that it’s given by 


12345 4: 1 678 
100 100 999’ 


where we’ll let you finish by finding a common denominator and adding the numerators. 


Of course, we should mention the result that perplexes so many people: 


2 9 9 9 ae 
0.9999999 = - + op iea ie ge 1, 


This bothers people because the thinking is that the repeating 9’s never “jump the decimal 
point”, and that 1 should be the “next” real number after 0.9999999. We’ll make two comments 
about this. 


First, given two real numbers a < 0b, there are always an infinite number of real numbers 
in-between a and b. For instance, (a+b) /2 would be halfway between a and b; so, there is never 
a “next real number”. Second, infinite decimal expansions are defined to be the sum of the 


infinite series formed by placing each digit over the appropriate power of 10. This means that, 
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by definition, an infinite decimal expansion is a limit; the limit of the partial sums. If you think 
of 0.9999999 this way, there shouldn’t be any confusion. What real number is being approached 
by the partial sums 


0.9, 0.99, 0.999, 0.9999, 0.99999, 0.999999, ... ? 


Uhhhbhh...1, obviously. 


We wrote earlier that, aside from geometric series, there’s only one other type of series for 


which we can find nice formulas for the partial sums: telescoping series. 


Recall, back in Definition]2.1.6| that, starting with a function f, the finite difference function 
Af is defined by (Af)(k) = f(k) — f(k — 1), where, if f(k) is defined for integers & such that 
m<k<n, then (Af)(k) is defined for m+1 < k < n. For instance, Ak? = k?—(k—-1)? = 2k-1 


(as we saw in Proposition |2.1.10). 


The definition that we give below contains the ambiguous term “clear”; we will try to explain 
the point, and the problem with giving a precise definition, in the example that comes after the 


definition. 


Definition 5.2.9. A series beet by, is telescoping provided that there is a clear, elemen- 


tary function f such that, for allk >m, by = (Af)(k). 


Our interest in telescoping series stems from Proposition]2.1.9} which, in our current context, 


tells us precisely: 


Proposition 5.2.10. Suppose that we have a (telescoping) series \\7—,,, bh in which by = 
(Af)(k), for some f which is defined for all integers greater than or equal tom — 1. 


Then, for alln > m, the partial sum 8, = > y_, be satisfies the equality 


f(n) — f(m— 1). 
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Therefore, the series )\7-_,,, by converges if and only if the sequence f(n) converges, i.e., if 
and only if limpn—soo f(n) exists, and when the series converges, it converges to 


(im f(n)) — f(m—1). 


N— Oo 


Recall that the proof for the above statement about the partial sums is not complicated; it’s 


just that things cancel in pairs in the summation: 


yb = Bn + dna Hv + Omer + Om = 


k=m 
(f(n)—f(n—-1)) + (f(n-1)— f(n—2)) + ++» (f(m+1)— f(m)) + (f(m)— f(m—-1)) = 


f(n) — f(m—1). 


It’s true that the rigorous proof requires mathematical induction, but the above observation is 
the heart of the matter. 


Example 5.2.11. To produce a telescoping series, you can simply pick an f and then take its 
finite difference. For instance, we’ll take f(k) = —1/k. Then, 


ait =| 1 1 
(AN) = = k-1 k-1 k 


and 
k-1 k 
k=2 
is a basic example of a telescoping series. We can now apply Proposition|5.2.10|to conclude that 


co [4 1 
ya [as - i converges, and 


Sa - e] = (ime) = 1 = tim - Cy =o t= 


n—0o 
k=2 
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Of course, we could “simplify” the terms of this series by using that 


1 1  k-(k-1) 1 


k-1 k= ok(kK-1))—— ok(k—-1) 


So, now we see that 


The question is: do we want to say that the series 77°, Eka telescopes? No, because the 


= i) 
terms are not explicitly written as the difference function of some f. Of course, what’s not clear 


and think 


to one person may be clear to another — still, not many people could look at EES 
“ah - that’s clearly (Af)(k), where f(k) = 1/k”. 


1 


You should note how well-disguised the telescoping nature of the series is now; we’ve shown 
that 


> il ul ® re ee =| 
Sk(k—-1) 7 2-1 9 3-2 9 4-3 9 5-4 = 
and there is no cancellation of pairs staring us in the face that makes this summation obvious. 


You might think that it shouldn’t matter if the terms of the series are clearly a finite 
difference. In the definition of a telescoping series }77<, be, why don’t we just go ahead and 
write that there just has to exist an elementary function f, or maybe just an arbitrary function 
f, such that b, = (Af)(k)? 

The problem is that, for every series )*7~_,,, bk, there always exists a function f such that, 
for all k > m, bp = (Af)(K), and, in general, it is difficult /impossible to know, without a good 
bit of work, whether such a function f is elementary. 

Consider any series 77°. be = peal ~_ 0; Define s(m—1) = 0, and, for k > m, define the 
k-th partial sum s(k) = eae b;. Then, (As)(m) = s(m) — 0 = b», and, for k > m, 


Understand the situation we’re in. We want telescoping series so that we can obtain nice 
formulas for the partial sums via Proposition |5.2.10} What we’ve just seen is that the partial 
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sums yield a function which tells us that any series telescopes, but, if we use the partial sum 
function for the telescoping, we arrive at the useless result that knowing the partial sums would 
enable us to determine the partial sums. Hence, we need to put some restrictions on the type, 
or clarity, of the function f that we’re allowed to choose such that b, = (Af)(k). 


The point is, in order for Proposition |5.2.10]to be useful, the function f needs to be man- 
ageable, i.e., elementary, and it needs to be easily discernible. 


One of the things that we just saw in the previous example leads us to an interesting result, 
a result that can tell us that there’s an easy way to see that some series diverge. 


Suppose that have a series by, and it converges to L. If we let s, be the partial 
sum Ss, = eae by, then, as we saw in the example above, b, = 8, — Sn—1. However, as the 
series converges to L, the partial sums converge to L, and so limp Sn = LiMy soo Sn-1 = L. 


Therefore, 


lim b, = lim (s, —S,_1) = lim s, — lim s,_; = L—L = 0. 
nm— co n—oo n—oo nico 


Thus, if a series converges, then the terms must approach 0. This is usually used in the logically 


equivalent, contrapositive, form, which we state first. 


Theorem 5.2.12. (Term Test for Divergence) 


. If the terms of a series do not approach 0, then the series diverges. 
. Equivalently, if a series converges, then the terms must approach 0. 


. If the terms of a series approach 0, the series may either converge or diverge, t.e., you 
can conclude nothing merely from the fact that the terms approach 0. 


Item 2 of the Term Test tells us that any convergent series must have its terms approach 0; 
so that every convergent series that we’ve discussed so far, like the alternating harmonic series 
or geometric series with |r| < 1, are examples of convergent series for which the terms approach 


0. However, we have mentioned before the harmonic series, 


: 1 —-14 1 | 1 | 1 | 
h = T 2 T 3 T 4 T > 
k=1 
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which is a fundamental example of a series in which the terms approach 0, and yet the series 
still diverges. We will show that the harmonic series diverges in Proposition |5.2.16| after we 
give some easy examples of using the Term Test for Divergence. But what’s going on? How can 


a series diverge if the terms approach 0? 


The point of the harmonic series example is to show that it’s not enough for the terms 
of a series to approach 0 in order to conclude convergence of the series; the terms 


need to approach 0 fast enough to make up for the fact that we’re adding more and 


more of them together. 


Example 5.2.13. Consider the series 


Se). 


k=1 


1 7 1 
As k > ov, the terms ri + — approach 1 not 0. Therefore, the series diverges by the Term Test 


for Divergence, Theorem 5.2.12 


Understand what’s going on here: the sequence of terms converges, and what the terms 
converge to is 1/4, but the value of the series is what you get when you add together an infinite 
number of terms, so, basically, in the series, it’s as though you’re adding together an infinite 


number of 1/4’s; not surprisingly, this adds up to co, and so the series diverges to co. 


It is a fundamental error to confuse the limit of the terms with the sum of the series. The 
sum of the series is absolutely not the limit of the terms. The terms are what are being 


added in the series. The sum of the series is the limit of the partial sums, which is thus 
what is approached as you add more and more terms. 


Example 5.2.14. What does the Term Test for Divergence tell us about the convergence or 
divergence of 


? 


>| NI 


» 


oo 
k=1 
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It is tempting to say “ah — now the terms approach 0, so the series must converge, by the Term 


Test for Divergence”. 


But remember Item 3 of the Term Test of Divergence; you can conclude nothing about the 


convergence or divergence of the series when the terms approach 0. 


In fact, the series 77° i is, term-wise, 7 times the harmonic series, and so, like the harmonic 


series, it diverges. 


lee) 
Example 5.2.15. What about the series So(-1*? 
k=0 
Here, the limit of the terms does not exist, since, as k gets bigger, (—1)* continues to alternate 
between —1 and 1. Does the Term Test for Divergence tell us anything about this series? 


Yes - if the limit of the terms fails to exist, then it is true that “the terms of the series do not 
approach 0”; in fact, the terms don’t approach anything, so, in particular, they don’t approach 


oI 


0. Hence, the series )>7°.9(—1)* diverges by the Term Test for Divergence. 


We have put it off long enough. We are going to show that the harmonic series diverges. In 
fact, this will follow, as a special case of our p-series Test, Corollary |5.3.15| but we give a more 


basic argument here because it is instructive. 


Proposition 5.2.16. The harmonic series 


me a ie ee 


diverges to ow. 


Proof. We want to show that, as n — oo, the sequence of partial sums 


sl) Re 


k=1 
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goes to oo. 


Note that, as each partial sum is obtained from earlier partial sums by adding positive terms, 
the sequence of partial sums S;, is increasing. Therefore, by Theorem |5.1.23| the sequence S,, 


diverges if and only if the sequence is unbounded. 


We shall find a lower bound on some of the partial sums, the partial sums of the form Sop, 
and show that this lower bound approaches oo, which forces the harmonic series to diverge. 
Technically, we should use mathematical induction, but we’ll make the idea clear. 


The “trick” is to group together collections of terms in such a way that each group clearly 


adds up to something greater than 1/2. We will indicate the groups with parentheses: 
Gi al ee et eo ee eee ee ee as, 
\2 gO AP NG 6 OE BY NO 10 TL * ae. 1S 4 a ey 


You always end a group at 1 over a power of 2, and start the group at 1 over the next number 


after the last power of 2. Notice that, in each group, all of the summands are greater than or 
equal to the last one in that group. So that, what we’ve written above is clearly greater than 


or equal to 


Thus, what we find is that, for each p > 1, 


SoS i 


i) 


As p — oo, 1+ § — oo, and thus Sy» — oo. Therefore, the increasing sequence of partial 


sums is unbounded and approaches oo. 


The harmonic series should be an example that you always keep in mind. It’s the most basic 
example of a series in which the terms approach 0, and yet the series nonetheless diverges. 


This shows that you cannot conclude convergence of a series simply because the terms 
approach 0. 
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Multiplying a series by a constant, and adding two series works as you’d expect. The proofs 


using limits of partial sums are straightforward; we leave them for you as an exercise. 


Theorem 5.2.17. Suppose that we have infinite series \~7—,,, ax and S\p,,, br- 


1. Assume that c 0. Then, the series \°~ a, converges if and only if the series 
7 k=m g y 
ein Cake converges and, when the series converge, 


ee) 
Cc: ) Qk. 
k=m 


2. Suppose that Ye ay converges. Then, See by, converges if and only if Py aan (a+ 
bg) converges and, when the series converge, 


S- (ax + dx) - So ax + So be 
k=m 


k=m k=m 


By letting c = —1 in Item 1 above, and combining with Item 2, you can, naturally, obtain 


an analogous result about subtracting one series from another. 


Example 5.2.18. By Theorem the series }77°., (2)* converges and, by Remark |4.4.13 
and Proposition 5.2.6} the series )>7°.4(—1)*~! (;) also converges (it’s the alternating harmonic 
series, except that the first term is missing). In fact, we know what these series converge to: 


» (3) = [5 (3) ]-1-4- y-$-3 


k=0 


and 
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Thus, Theorem |5.2.17/ tells us that 


6-y-(2) _ eae (;) = 65 — ((In2)—1) = 2—In2. 


Example 5.2.19. You need to be careful not to try to apply Theorem |5.2.17|to sums of two 


divergent series. The term-by-term sum of two divergent series may diverge or converge. 


For instance, the harmonic series )77~_, i diverges. If you take the harmonic series, and add 

it to itself term-wise, you get 

“fil 1 2 

(Ets) = Le 

k=1 
which diverges by Item 1 of Theorem|5.2.17| since it’s 2 times the harmonic series (in the term- 
wise sense). What this shows is that the term-by-term sum of two divergent series may again 
be divergent. 


On the other hand, using Item 1 of Theorem|5.2.17|with c = —1, we conclude that 77, (- i) 
diverges. But the term-wise sum of )77-., ¢ and )77_, (—Z) is 


However, you may correctly suspect that the above example of two divergent series being 
added, term-wise, and producing a convergent series can happen only because of cancellation. 
If all of the terms of both series had been > 0, or all of the terms of both series had been < 0, 
then the term-by-term sum of two divergent series would again be divergent. We will see this 


in the next section. 
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In the example above, we saw that standard algebraic operations may not work so well when 


dealing with divergent series. Actually, things may be worse than you suspect. 


Suppose that we have a series 


[oe} 


) ak = Am + Am41 + Am4+2 + Am+3 + Am44 + AGm45 + °°: 


k=m 


Since addition is associative, you might suspect that we can throw in parentheses anywhere that 
we want and not change the sum. However, if the series diverges, it is not true that grouping 


the terms necessarily leads to a series that continues to diverge. 


Example 5.2.20. Consider the series 


i th Se Te 


We write this summation without any parentheses, because we assume that it’s obvious that 
we mean the series )>7° ,(—1)*~!. This series diverges by the Term Test for Divergence, Theo- 
rem|5.2.12} the terms do not approach 0. However, you’re so used to the associativity of addition 
that you might believe that this series is the same as 


G21 =i) a oa ce 


or 


De a oe De LD eee ae) chee 


but these latter two series converge to 0 and 1, respectively, assuming that the groupings indicate 
the terms of the series. Thus, it is possible to start with a divergent series and, by grouping 
some terms together to form a new series, arrive at convergent series, and it’s possible to get 


convergent series which converge to different numbers. 


However, for convergent series, “associativity” works as you would expect. 
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Theorem 5.2.21. Suppose that the terms of a series Leas b; are formed from a convergent 
series )\-—_,,, ax by grouping disjoint finite sums of the ax, without changing the order, i.e., 
suppose that the series i b; 1s formed by “adding some parentheses” to the convergent 


. lo) 
SEries pea es 


foe) co 
Then, an b; converges, and S- b; = yD Qk. 
jg=n 


k=m 


Proof. The partial sums of ae b; are sums of the a,’s, in order, and so form a subsequence 


of the sequence of partial sums of 577, ax, which we are assuming is a convergent sequence. 
Theorem |5.1.27|/immediately yields the theorem. 


As we shall see in the next section, if all of the terms in a series are non-negative, it is not 
possible to group a divergent series in such a way that you get a convergent series; that is, the 
weird thing that happened in Example |5.2.20| can’t happen if all of the terms in the series are 


non-negative. 


5.2.1 Exercises 


In each of Exercises 1 through 10, list the first 5 terms of the series, and the first 
5 partial sums. Give the partial sums written out as summations, and (possibly 


using a calculator), also give each partial sum as a single number. 


i, 
I cM a Mas Be og eo Be 
1 10 13° 16° #19 | 
Uy 
1 if 1 1 1 
+ + 
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10. 


In each of Exercises 11 through 20, you are given m and a formula for the partial 


sum s, = )~\_,,, b; of an infinite series )>* bj. (a) Give the terms },,,...,0m44, (b) 


love) 
i=m bj 


give a formula for the general term b;, and (c) determine whether the series )~ 


converges or diverges; if it converges, determine what it converges to. 
1 
ll. s, =—-, m=1 
n 


12. s,=1-—-, m=1 
n 
13. s, = (-1)", m=0 


14. s, =1, m=0 


15. Py one et © 
er 


16. m= m=l1 

1 
17. — 7 vin Oa — 
7. 8 @— 2-1) m=3 


18. 8, = In(5n?) —In(n? +1), m=1 


19. s, = , m=1 


20. s, = ——, m=1 
n 
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In each of Exercises 21 through 30, you are given a series which is geometric, 
telescopic, or diverges by the Term Test for Divergence, Theorem|5.2.12| Determine 


if the series converges or diverges; if it converges, determine what it converges to. 


mE) 
= 
» 50) 
“EC 

28. > [0+ 1)? 2" 
een 


30. S- tan ‘(q) 


In each of Exercise 31 through 40, determine if the given series converges or di- 


verges. If it converges, determine what it converges to. 
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33. 


34. 


35. 


36. 


37. 


38. 


39. 


AO. 


In each of Exercise 41 through 44, write the repeating decimal as a quotient of 


positive integers. 


Al 


42. 


43. 


4A, 


45. 


A6. 


AZ. 


. 0.2301230123012301 


0.141572727272 


2.71828182818281 


123.123123123 


Suppose that }°7°, a, = 7 and S772, b, = —2. What can you conclude about the series 


Suppose that 5772, ax = 3 and S772, b, = —3. What can you conclude about the series 


Suppose that 77°, a, = 7 and S°°<_, bj, diverges. What can you conclude about the series 
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48. Suppose that °7°., ax diverges and 77°, by diverges. What can you conclude about the 
series )77° (2a; — 3b,)? 
49. Consider the series 
= 3 5 9 17 
= 14 14 1 1 
dua = 5 8 * 46 . 
k=0 
in which, for all n > 0, aon = (2°41 +1)/2"*! and aon41 = —1. 
a. Show that the series diverges. 
b. By grouping terms together, i.e., by inserting parentheses into the summation, show 
that you can obtain a convergent series. 
a. ae 
50. Consider the series S- (-—1)" + BI: 
k=1 
a. Show that the series diverges. 
b. By grouping terms together, i.e., by inserting parentheses into the summation, show 
that you can obtain a convergent series. 
51. Explain why the harmonic series is an important example, i.e., what it is an important 
example of. 
52. Prove that the “even harmonic series” 3 a diverges 
: Qu a? ges. 
53. Prove that the “odd harmonic series” shal diverges. 
= el 
54. Prove Theorem |5.2.17| on multiplying series by constants and adding two series. 
55. A man is jogging home, accompanied by his dog. The man is a mile from home, and jogs 


at a constant speed of 5 miles per hour. His dog runs in front of him at a constant speed 
of 10 miles per hour and, when the dog gets home, he turns around and runs, at 10 mph, 
back to the man. When the dog reaches the man, the dog turns around and runs home 
again, at 10 mph. The dog continues running home and returning the man, at 10 mph, 
for the entire time it takes for the man to jog home. Assume that the time it takes the 
dog to turn around is negligible, so that the dog is really running at 10 mph for the whole 
trip. 


a. Write an infinite series for the total distance traveled by the dog. Your series should 


have as terms the distance traveled by the dog during each leg of his trip. 
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b. Sum the series from part (a). 


c. Use a simple distance/speed/time argument to obtain the same answer that you 


obtained in part (b). 
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id 
5.3 Theorems on Series II: 


Non-negative Series 


Series in which all of the terms are greater than or equal to zero, non-negative series, or in 
which all of the terms are less than or equal to zero, non-positive series, are easier to deal 
with than series that contain both positive and negative terms. However, all of the results on 
convergence/divergence for non-positive series can be obtained by negating, term-by-term, to 
obtain non-negative series, and then looking at the results for non-negative series. Consequently, 
it is customary to state the results, the convergence and divergence tests, for non-negative series 


only, leaving the results for non-positive series as quick corollaries. 


Non-negative series are more manageable than series with both positive and negative terms 
because, for a non-negative series, the sequence of partial sums is increasing, and Theorem|5.1.23 
then tells us that the sequence of partial sums converges, i.e., the series converges, if and only 


if the sequence of partial sums is bounded above. 


This will allow us to derive three useful tests for convergence of non-negative series: the 
Integral Test, the Comparison test, and the Limit Comparison Test. 


Definition 5.3.1. A series S km Ok 18 non-negative provided that, for allk > m, by > 0. 


Example 5.3.2. A geometric series S K=0 ar” is non-negative if and only ifa >0 andr > 0. 


The harmonic series S k=l i is non-negative, while the alternating harmonic series y y-1(—-1 


is not non-negative (nor is it non-positive). 


The series 


is non-negative, even though there are an infinite number of terms which equal 0. 


ee 
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From the definition of a non-negative series, the sequence of partial sums of a non-negative 
series is increasing and so, from Theorem |5.1.23| we immediately conclude: 


Theorem 5.3.3. The partial sums of a non-negative series are an increasing sequence and, 
consequently, a non-negative series converges if and only if the partial sums are bounded 


above; if the partial sums are bounded above by M, then the sum of the convergent series is 
<M. A non-negative series diverges if and only if it diverges to co. 


Recall the definition of a subsequence in Definition |5.1.26} If we have a series \>>~_,, an 


(non-negative or not), and form a new series by using as terms a subsequence of the a,,, we refer 


to this new series as a subseries. 


Now, if we have a non-negative series )7-~_,,, dn, then the partial sums of any subseries are 
less than or equal to partial sums from }*°~_,, an itself. Therefore, Theorem immediately 
yields the following corollary. 


Corollary 5.3.4. Suppose that onan Gyn is a non-negative series which converges. Then, 
every subseries of )>_, dn converges. 


In other words, if a subseries of a non-negative series diverges (necessarily to oo), then 
the original series also diverges to oo. 


Example 5.3.5. Consider the last non-negative series from Example [5.3.2} 


eis _ (kr 1 1 1 1 1 1 1 
y (1+sm (F)) (F) Sel te Os ea eae Se ee 


k=1 


If we consider the subseries using those terms where k is even, we obtain 


[e 


1 1 1 1 i 
, ‘ Is ie = 
Log test qoute eee oh 2 


ae 


Nl re 
— 


which is, term-wise, 1/2 of the harmonic series. Thus, by Theorem |5.2.17| 5>°° 


jel 2°; diverges 
and by Corollary |5.3.4| so does the original series 377°, (1 + sin (££)) (4). 
k=1 2 k 
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Example 5.3.6. The conclusion of Corollary is not true (in general) if we omit the 


condition that the series is non-negative. Consider the alternating harmonic series, 


which, as we discussed in Remark |4.4.13] converges to In2. However, if you just add up all 
of the negative terms, the terms with even denominators, you get, term-wise, —1/2 times the 
harmonic series, which diverges. 

Thus, series that contain both positive and negative terms may converge, even though sub- 


series may diverge. 


Recall that T heorem [5.2.21] told us that grouping together sums in a convergent series has 
no effect on the convergence. In Example [5.2.20] we saw a divergent series that contains both 
positive and negative terms, for which it is possible to group the terms to form new series that 
converge. As another corollary to Theorem [5.3.3] we will now show that you cannot group the 


summations in a divergent non-negative series in order to produce a convergent series. 


lee) 
jg=n 
non-negative series )\--_,,, a, by grouping disjoint finite sums of the a;, without changing the 


Corollary 5.3.7. Suppose that the terms of a series \> b; are formed from a divergent 


order, i.e., by “adding some parentheses” to the divergent series \\y~_,,, Gk- 
Then, sae b; also diverges. 


Proof. Each partial sum of })5<.,, 6; is also a partial sum of )°7",,, ax, and the partial sums of 


hem @k diverge to oo. Therefore, the partial sums of }75"_, bj also diverge to oo. 


Now that we know, from Theorem |5.2.21] and Corollary |5.3.7| that grouping terms (i.e., 
inserting parentheses around finite numbers of terms) does not affect the convergence or diver- 
gence of a non-negative series, we would like to know that changing the order of the summation, 


i.e., rearranging the order in which the terms are added, also does not affect non-negative series. 


For instance, we know that 
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converges and, in fact, we stated that it converges to 77/6. Does it follow that the series formed 


by switching pairs of terms, 


sel a i Ee Oe Hee (5.2) 


converges, and converges to 77/6? You might think “sure - a sum doesn’t care what order you 
add things in”, and that’s true for finite sums, but, as we shall see in the next section, it is false, 
in general, for series containing both positive and negative terms. However, non-negative series 


are well-behaved with respect to such rearrangements. 


First, we need to give the technical definition of a rearrangement. Note that the only thing 
that we really need to rearrange is the order of the indices of the sequence of terms. For instance, 
if we let a, = 1/k?, then the series in Formula|5.2| is 


ag + ay + ag + a3 + Ag + a5 4 


Suppose that, for all integers 7 > 1, we define r(j) = 7 + 1 if j is odd, and r(j) = j — 1 if 7 is 


even. Then, 


Then, the series in Formula is simply )7>7., b,(h)- All that we used here to reorder, or 
rearrange, our series was the one-to-one correspondence (the bijection) r between the original 


indices and the newly ordered indices. 


Definition 5.3.8. Consider a (not necessarily non-negative) series \y7-_,,, bk- 


A permutation of the indices of the series is a one-to-one correspondence, a bijection, 
Pa Zsm + L>m- 


A rearrangement of the series )°;,_,,, by is a series \0y—m, Or(ky, where r is a permu- 
tation of the indices. 
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Theorem 5.3.9. Consider a non-negative series 07, bk- 


1. If OP, be converges to L, then every rearrangement of S>7_,,, bk converges to L. 


2. If 7, be diverges, then every rearrangement of \\7—,, by diverges. 


Proof. First, note that Item 2 follows from Item 1, since the original series is a rearrangement 
of any rearrangement of itself; so, if a rearrangement converges, Item 1 implies that the original 
series converges. 


Hence, we need to prove Item 1. Suppose that = a b, converges to L and that we have 


n 


a rearrangement )>7__,,, Orn). Let Sn = Sopm be denote the partial sums of 77°, bx, and let 


Rn = pem br(k) denote the partial sums of S77_,,, br(x)- 


For each n, all of the terms in S,, eventually appear in some partial sum of )77°,, bre); let 
Ra(n) be such a partial sum. Note that a(n) > n (since the R partial sum has to contain at least 
each term of the S' partial sum). Note also that, since on is non-negative, any extra terms in 


the R partial sum are non-negative and, hence, S;, < Ran). 


Similarly, for each p, all of the terms in R, eventually appear in some partial sum of 577, bk: 
let Sg(p) be such a partial sum. Then, G(p) > p, and Rp < Spi). 


Combining our inequalities, and letting p = a(n), we conclude that, for alln > m,n < 
a(n) < B(a(n)) and 
Sn S Rain) S Sa(a(n))- 
Asn — oo, both a(n) and G(a(n)) approach oo, and so, by the Pinching Theorem for Sequences, 


Theorem the sequence of partial sums Ran) converges to L, i.e., os br(z) converges to 
L. 


Remark 5.3.10. You should understand the point of Theorem |5.2.21} Corollary |5.3.7, and 
Theorem Together, they tells us that, for non-negative series, the sum of a series, 
which is the limit of its partial sums, behaves like a true sum of the collection of terms: the 


order in which we take the terms and/or how we group them does not affect the sum. 


We want to emphasize again that these things are not true, in general, for series with 
both positive and negative terms. Such a series can diverge, and yet, some groupings of the 
terms can lead to convergent series (Example |5.2.20). Moreover, rearrangements of general 


convergent series can lead to series that converge to different values or even diverge to oo; see 
Theorem [5.4.20 
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We would like a test for convergence of series that allows us to use our knowledge of integrals, 
since we have many techniques for integrating. Moreover, it’s not too hard to believe that 
integrals could be useful when looking at series; infinite series involve sums and integrals are 


limits of Riemann sums. In fact, we have the very useful: 


Theorem 5.3.11. (The Integral Test) Suppose that m is an integer, and that the function 
f is defined and Riemann integrable on every closed, bounded subinterval [a,b] of [m, co), 
for instance, f could be continuous on [m,oo). Suppose further that, for x >m, f(x) > 0 
and is decreasing. 


Then, the series S- f(k) converges if and only if the improper integral f~ f(x) dx 


k=m 
converges. In addition, in the case where the integral and the series converge, 


m 


[fea < S10) < Hom) + 
k=m 


Proof. This proof is very intuitive; we will show that inequalities exist between the partial sums 


of the series and integrals in which the upper limits of integration are finite values of x. 


Suppose that k is an integer > m. As f is decreasing, for all x in the interval [k,k + 1], 
f(k+1) < f(x) < f(k) and so, by the monotonicity of integration, Theorem |2.3.20} 


| ws a | Fk) de. 


IA 
IA 


k+1 
/ Cee 
k 


Thus, 


k+1 
flk+1) < | f(x) ax 


IA 
SY 
— 
= 


and, hence, for all n > m, 
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Reindexing on the left, and adding together the integrals, we find 


n+1 n+1 n 
S> s(k) < | f(a)dx < So f(b). 
k=m+1 ae k=m 


If the integral c f(x) dx converges, then the left-hand inequality tells us that the partial 
sums of the infinite series are bounded above and, therefore, the series converges by Theo- 
rem [5.3.3] If the infinite series converges, then the right-hand inequality above tells us that the 
integrals ais f(x) dx are bounded above. Then, by Theorem [2.5.14] the integral i f(a) dx 


converges. 


In the case where the integral and series converge, by taking limits, we immediately obtain 


Y 10) < fo fae < SY 10). 
=m+1 m 


k k=m 


The right-hand inequality above yields the left-hand inequality in the statement of the theorem. 
The right-hand inequality in the statement of the theorem is obtained by adding f(m) to each 


side of the left-hand inequality above. 


Example 5.3.12. Let’s look at f(a) = 1/z, for x > 1, This function is certainly positive and 
decreasing. Therefore, the Integral test tells us that }>7°, i converges if and only if ie + dx 
converges. Of course, the series here is the harmonic series, which we proved diverges in Propo- 
sition [5.2.16] So we can conclude that the integral diverges. However, we can see easily that the 


integral diverges, 


x boo J, « b—00 


Co 1 . by : 
—dx = lim —dx = lim (Inb-Inl1) = ov, 
1 


and so the Integral test gives us another proof that the harmonic series diverges. 
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1 
Example 5.3.13. What does the Integral Test tells us about 5" p! 
k=1 


We look at the function f(x) = 1/x?; this function is non-negative and decreasing for x > 1, 


and 


| 
Hence, the Integral Test tells us that s 2 converges, and 
k=1 


[o<) 
1 
In fact, it can be shown, using Fourier series techniques (see [3]) that S- pz converges to 
k=1 
n/6, which is, in fact, between 1 and 2. 


“1 
Example 5.3.14. Use the Integral Test to decide whether or not >» ——— converges. 
& k(n k)? 


Solution: 


The function f(x) = is certainly non-negative and decreasing for « > 2. We find 


1 
x(In a)? 
the indefinite integral by substitution of u = Inz, so that du = (1/x)dz, and 


1 1 1 1 
/ x(n a)? ae i uo u vi Ina oe 


And now we calculate the definite integral: 


Pe eins 
—— dx = lim (-—~+——] = —. 
9 a«(inz)? . boo \ Inb In2 In2 
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ee 
Therefore, », kinky? converges, and 


1 1 


2 ioe} 
a < : 
n2 ae S < oan2? * ina 


As we saw in Example |5.3.12| and Example |5.3.13| the Integral Test works very quickly to 


let us decide about the convergence or divergence of a series of the form 


a 
Dae 


> 
Il 
un 


where p was, respectively, 1 and 2 in the two examples. More generally, such a series is called 
a p-series and, as a corollary to the Integral Test, we find that the harmonic series, the p = 1 


case, is the dividing case between p-series converging and diverging. 


Corollary 5.3.15. (The p-Series Test) The p-series 


oe) 


1 
Die 
k=1 


converges if p > 1 and diverges if p< 1. 


In addition, in the convergent case, where p > 1, 


“1 1 
es ee 
=1 


Proof. We leave this proof as an exercise. 


Co 
Even if the indexing does not begin at 1, we still call a series S- 1 a p-series and, since 


k=m 
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omitting a finite number of terms does not affect convergence/divergence, such a p-series still 


converges if and only if p> 1. 


“i “1 
Example 5.3.16. The p-Series Test tells us immediately that S- 70.999 diverges, while S- 71.001 
k=1 k=1 


converges to some value v such that 1000 < v < 1001. 


It is important not to confuse p-series with geometric series, even though they look very 
similar. 


Co 


co 
i. : ’ ie ; 
= 3R is geometric, while D =) is a p-series. 


k=1 
Geometric series have a fixed base and variable exponent, while p-series have a variable base 
and a fixed exponent. 


Our next convergence test is easy to see. Suppose that we have two sequences of terms, 
where one is always less than or equal to the other, i.e., for k > m, we have 0 < ax < by. Then, 


all of the partial sums satisfy the analogous inequality 


Recalling from Theorem that the only way for a non-negative series to diverge is for it 


to diverge to oo, we immediately conclude: 
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Theorem 5.3.17. (The Comparison Test) Suppose that, for all k > m, we have the 
inequalities 0 < ap < bg. 


1. If OP, be converges, then so does \*7-,,, ax, and 


oo 
ak < be 
k=m 


k=m 


2. If yr, a diverges (necessarily to co), then so does \°7-_,, be. 


Remark 5.3.18. The conclusions of the Comparison Test should not be hard to remember, 
provided that you keep in mind that the only way for a non-negative series to diverge is for it 
to diverge to oo. If you remember this, the Comparison Test basically says that a series being 
less than or equal to than something that’s < co implies the series is < oo, and a series being 
greater than or equal to something that’s oo implies that the series is oo. 


If you think about it, it’s also not hard to see what the Comparison Test does not say. If 
the bigger series diverges, it diverges to oo, and all you know about the smaller series is that it 
adds up to something < oo; but that means that the smaller series could converge to something 
finite or diverge to oo, so you conclude nothing. If the smaller series converges, then the bigger 
series adds up to something larger, but that something could be finite or oo, so once again you 


conclude nothing. 


You need for the bigger non-negative series to converge, or the smaller non-negative series 


to diverge in order to conclude anything from the Comparison Test. 


lee) co 
1 1 
Example 5.3.19. Consider the series ——= and ——_.. 
p nsider ri s a n S E05 
k=1 k=1 
1 
and we know from the p-Series Test that y 72 converges, the Comparison 
k=1 


1 


1 
3. 
AS a3 S 


co 
1 
Test tells us that », a3 also converges. 
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IV 


1 =! 
Ens R for k > 1, and we know that the harmonic series », i diverges, the 


CoO 


1 
Comparison Test tells us that », P05 also diverges. 


1 1 “1 
Note that the facts that Rp < Pos and that », Rp converges do not allow us to conclude 


co 
1 
from the Comparison Test anything about the convergence or divergence of we Pn05 
k=1 : 


1 
What we saw at the end of the example above, that comparing a 72 and See 05 tells 


us nothing about the convergence/divergence of > RB is somewhat frustrating. Yeah - 
k 


— 0.5’ 
1 1 _ 
it’s true that =) = 05 05’ and yet you may have the feeling that, when & is really big, the 
ee 
—0.5 just shouldn’t matter — it’s negligible — and the series ye P05 should do whatever the 
k=1 : 


series 2 — 5 does. 


The following theorem says that this line of thinking is valid. 


Theorem 5.3.20. (The Limit Comparison Test) Suppose that, fork >m, a, > 0 and 
by > 0. Suppose also that 


where L may be infinity (so, the limit exists as an extended real number). 


1. If L#0 and L 4 ~, then the two series °°, ax and S7P_,, by do the same thing, 
i.e., either both converge or both diverge. 


2. If L=0, and 7, be converges, then \\7~_, ag converges. 


3. If L=o, and Y,,, by diverges, then \ ~__,,, an diverges. 


Proof. We give the proof in Case 1, and leave the other two cases as an exercise. 


So, suppose that 
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where L > 0 is areal number. Note, first, that this means that there has to exist mo such that, 


for all k > mo, by 4 0, for, otherwise, the fraction doesn’t even exist for arbitrarily large k. 


Let « = L/2 > 0. Then, by definition of the limit of a sequence, there exists an integer 
N > mp such that, for all integers n > N, 


An L 

— — L| < €= =. 

Bn | ao 
Thus, for n > N, 

L gts L< 

2 ~— bn 2" 


and, hence, 
L 3L 
= < < — . 


where we used that b,, > 0 forn > N. 


Now, suppose that °° b, converges. Then, by Proposition [5.2.6] yy bn converges, 
and by Theorem [5.2.17] ae (32) b, converges. Hence, using the inequality on the right- 
hand side of Formula5.3|and the Comparison Test, Theorem|5.3.17| we conclude that S74 an 
converges. Applying Proposition|5.2.6] again allows us to conclude that J°**_,, an converges. 


The other conclusion is entirely similar. Suppose that © ad, converges. Then, by 
Proposition|5.2.6} $>°° yan converges. Using the inequality on the left-hand side of Formulal5.3] 
and the Comparison Test, we conclude that S\>-_y (4) b, converges. By Theorem |5.2.17 
yr bn converges. Applying Proposition again allows us to conclude that S77~_,,, bn 


converges. 


Example 5.3.21. So, suppose you want to decide whether or not 
a ; 
2 
= k? —0.5 


converges. How do you think about such a problem? 


You think the way we discussed before proving the Limit Comparison Test. You think: 
“when & is big, the 0.5 (added or subtracted) is negligible, and this series should do whatever 
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fo) 
1 
Ne RB does, which is converge, since it’s a p-series with p = 2 > 1.” Thus, just by looking 
k=1 
at the series (and knowing the appropriate convergence tests!), you immediately conclude that 


ee 
S- e_05 converges. 
k=1 
Of course, we still need to show that the 0.5 was indeed “negligible” by calculating the 


required limit in the Limit Comparison Test. This means that we let one of the series be the 
co 

one we don’t know about, S- 08° and we let the other series (that we limit compare with) 
k=1 : 

be the one that we know about, whose terms look like those of the original series without the 


co 
1 
“negligible” parts, i.e., we limit compare with ~ Rp 


k=1 
We calculate 
1 
k2 — 0.5 i 
l = | = | = = 
a es ei a5 
k2 
and see that, yes, we may use the Limit Comparison Test between ‘2 oe and > es 
> YES, y Pp a j2 —0.5 va 3 


and conclude that S- Po05 converges because > =) does. 
k=1 : k=1 


Remark 5.3.22. It was not important in the example above, or in the Limit Comparison Test 

in general, that jim (ax /b,) equals 1; if the limit had turned out to be 37, we could still have 
Co 

used the Limit Comparison Test in exactly the way that we did. 


However, what you typically get for the limit in the Limit Comparison Test is 1. 


Why? Because of the way you arrive at the series that you limit compare with. 


You start with some non-negative series }>a,. You look at it and think “ah, when k is big, 
these parts of the terms are insignificant/negligible and, if I ignore the negligible parts, I get 
a non-negative series }*b;,, and I know what the series 5b, does, so [ll limit compare with 
that.” 


But, if you’ve correctly decided that ay, and by look alike, except for parts that are negligible 
when k is big, then that precisely means that jim (ax /bx) = 1. 
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As our last two results on the convergence/divergence of non-negative series, we will state 
and prove the Ratio Test and the Root Test for non-negative series. We stated the Ratio Test 
for arbitrary series back in Theorem and we shall look at both of these tests again in 
the next section on series which contain both positive and negative terms. However, in a strong 
sense, the Ratio and Root Tests are really theorems about non-negative series; it’s just that 
these results can be extended to arbitrary series by taking absolute values of the terms (see 
Theorem and Theorem |5.4.14). 


Theorem 5.3.23. (The Ratio Test for Non-Negative Series) Consider the series 
S- be, where by > 0. 


k=m 


1. If there exists r <1 (and, necessarily, > 0) and an integer M > m such that, for all 
k>M, 
br+i 


by # 0 and 
br 


then the given series converges. 


2. If there exists an integer M > m such that, for alk > M, 


by 
by #4 O and 2 ile 


k 


then the given series diverges. 


b 
In particular, suppose that lim ia exists, as an extended real number; call its value L. 


k—-o0o ke 
Then, 
a. if L <1, the given series converges; 
b. if L > 1, including L = ov, the given series diverges; 


c. if L=1, the given series may converge or diverge. 


Proof. Let’s dispose of Case 2 first. Suppose that there exists an integer M > m such that, for 
alk > M, 


be # 0 and Diet >1 i.e., be+4 = br. 
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Then, for all k > M, b, > by > 0. Therefore, limp... b, is not zero, and so the series >> by 
diverges by the Term Test for Divergence, Theorem |5.2.12 


Now, let’s look at Case 1. Suppose that there exists r < 1 and an integer M > m such that, 
for allk > M, 
br 
by #0 and oa <r ie, bey < der. 
ke 
Then, for all k > M, 


by < bur’. 


Thus, the series )77° ,, by is, term-by-term, less than or equal to 77° ,,bur*. AsO <r <1, 


ru bur* converges, as it’s a geometric series with |r| < 1. Hence, by the Comparison Test, 


Sr be converges, and so S7P-_,,, by converges. 


Do not confuse the limits in the Limit Comparison Test, Theorem|5.3.20| and in the Ratio 
Test, Theorem |5.3.23} In the Limit Comparison Test, you expect, and want, to get 1 for 


the limit of the ratio of the terms of the two series. In the Ratio Test, if you get a 1 for the 
limit, that’s bad, because it means that you can’t conclude anything from the test. 


Example 5.3.24. Consider the series 


oo k! oo j,1000 oo T(k+ 1) 
2. = L.O1F ’ Bad 2, he 
k=1 k=1 k=1 


Let’s see what the Ratio Test tells us about the convergence or divergence of these series. 


For the first series, we calculate the limit of ratios 


SU! 
and so the Ratio Test tells us that s — diverges. 
5k 
k=1 
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For the second series, we calculate the limit of ratios 


COO _ jen or 
kovco LODO 7LQIF hoo | 00“ TAFT 


1+ ned =1 : <1 
k 1.01 1.01 


oo 7.1000 
and so the Ratio Test tells us that » Lo1F 


lim 
k-00 


converges. 


Note that, even though the exponent on the base & is large, 1000, and the base 1.01 is barely 
bigger than 1, the Ratio Test and the convergence of the series is telling us that, when k is large, 
1.01* is significantly larger than k10°°. 


Let’s look at the third and final series. We calculate the limit of ratios 


_ tik + 2)/(k + 1)8*? _. fre 42 kk 
lim = lm . . 
aren 7k(k + 1)/kk kool 7 k+1 (k+1)F*1 
La k \* 1 2 1 1 
jin} 8 (C5) eal - et 1yF  k+1] 
00 +i te + 00 +% (143) f 
1 
Polos = O21, 
e 
STK +1 


and so the series converges. 


kk 
k=1 


Remark 5.3.25. You might think that the p-Series Test, Corollary |5.3.15| would follow easily 
from the Ratio Test; however, that is not the case. If b, = 1/k?, then 


P P 
i = lim SEE TEE = lim es = lim ——~> 1, 


so the Ratio Test is useless here. 
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There is a test that looks very similar to the Ratio Test; it’s called the Root Test. Like the 
proof of the Ratio Test, the proof of the Root Test involves a comparison with a geometric series 
in which 0 <r < 1. We leave this proof as an exercise. 


Theorem 5.3.26. (The Root Test for Non-Negative Series) Consider the series 
S- by, where b; > 0. 


k=m 


1. If there exists r <1 (and, necessarily, > 0) and an integer M > m such that, for all 


k>M, 
y, br, < Tr, 


then the given series converges. 


2. If there exists an integer M >m such that, for allk > M, 
V br 2 1, 


then the given series diverges. 


In particular, suppose that jim ‘x exists, as an extended real number; call its value L. 
c+ OO 
Then, 


a. if LD <1, the given series converges; 
b. if LD > 1, including L = ov, the given series diverges; 


c. if L =1, the given series may converge or diverge. 


Example 5.3.27. The most obvious time to use the Root Test is when the terms are explicitly 


k-th powers. For instance, 


k=1 
certainly converges by the Root Test, since 
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Remark 5.3.28. The Ratio Test and the Root Test look very similar, and which one is easier 
to use depends on the series, though roots are frequently harder to manage than ratios. For 
instance, if the terms of the series involve factorials, like k!, then the Root Test will involve 
k-th roots of factorials, which are certainly not pleasant to deal with (see, however, Corol- 
lary |5.3.30). On the other hand, there are times when the Ratio Test fails (i-e., doesn’t tell us 
about convergence or divergence), and yet the Root Test easily yields a conclusion. 


Consider the series 


that is, a geometric series, with r = 1/2, except that we have each term twice. It may seem 
obvious that, since 77° , 1/ 2" converges to 1, that the given series converges to twice that; in 
fact, this is true and follows from the term-wise addition theorem, Theorem |5.2.17 


However, the Ratio Test is useless here for concluding convergence, since every other ratio 


of successive terms is precisely 1. 


1 
On the other hand, the Root Test works fine for this series. The terms are by = 5K? if k is 
1 are 
even, and by = CESNED if k is odd. So that 

: if k is : 

71/2 1 is even; 

Ri bi. = 
if & is odd. 


Therefore, limp... */bg = 1/2 < 1, and so the Root Test tells us that the series converges. 


As a final advantage of the Root Test, notice that the Root Test has no problem if an infinite 


number of terms in a series are 0, unlike the Ratio Test. 


The following result tells us that the Root Test “works” any time that the Ratio Test “works”, 


but the converse is not true, as we showed in the remark above. 
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Proposition 5.3.29. Suppose that by > 0 and limg.oo(be41/be) = L, where L is an 
extended real number. Then, 


lim ‘ by a De 
k—oo 


In particular, if the limit in the Ratio Test, Theorem|4.5.13 tells you whether a series 
converges or diverges, the limit in the Root Test, Theorem|5.3.26) tells you the same thing. 


Proof. See Remark 3.36 and Theorem 3.37 of {3}. 


The following corollary can be helpful in cases where you decide to use the Root Test. 


Corollary 5.3.30. 


lm Vk =1 


k—o0o 


In particular, limp... Vk! = co. 


Proof. For the first equality, you can take natural logarithms, using that the natural logarithm 
is continuous, and reduce the problem to showing that lim,_..(Ink)/k = 0, which follows 
easily by applying ’Hépital’s Rule (see B]) to lim, ...(Inz)/az. Alternatively, you may apply 
Theorem [5.3.29] to the sequence ax, = k. 


To show the second equality, consider the sequence b, = k*/k! . Then, we find 


lim Ott — lim (K+UR A /(kK+1)! 
k-c0o by 7 k—-oo kk /k) k— oo 


k! (k+1)* 
[ese ke 


lim 
k-o0o 


1 i\* 


Now, by the proposition, limp... Wbr = e, ie., 


lim = ‘1 i 


—— im = €. 
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5.3.1 Exercises 


In each of Exercises 1 through 10, use the Integral Test, Theorem |5.3.11} or the 
p-Series Test, Corollary |5.3.15| possibly combined with other results, to (a) decide 
whether the given series converges or diverges, and (b) for the convergent series, 


give upper and lower bounds for the sum. 


ef 
10. —___ 
2 e+ 


In each of Exercises 11 through 20, you are given a series )\,,,, ax. Find a se- 
ries ‘Seam by, which is a p-series, constant multiple of a p-series, or a geometric 


series, to compare with )°~-_,, a; to determine its convergence or divergence via the 
Comparison Test, Theorem |5.3.17 
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11. 


12. 


13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


Co 


1+sink 
Se 
k=1 


8 


k= 


Yo 
2, 
240) 


In each of Exercises 21 through 30, you are given a series )>;~,,, ax. Find a p-series, 


constant multiple of a p-series, or a geometric series }\7-_,,, b, to limit compare with 


Ge to determine its convergence or divergence via the Limit Comparison Test, 
Theorem [5.3.20 
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4 
24. S° ae 

4 
8.) ODP ine 


5 
26. aoe 
yy fj lnj 


27. 
28. 


29. ye 
30. ye @ oie 


In each of Exercises 31 through 37, use the Ratio Test or Root Test to determine 


whether the given series converges or diverges. 


33. ‘ 
aor 
p=4 


ee 
34. aa 
- 4 
35. Dam 


236. pono 
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In each of Exercises 38 through 43, combine any/all of your convergence and diver- 


gence tests and theorems to decide whether the given series converges or diverges. 


39. 


AO. 


Al. 


42. 


43. 


4A, 


45. 


A6. 


AT. 


—<f i 1 
Gta 


m+m}/2 


“ k-2 
i. 
k=3 


) 


Prove the p-Series Test, Corollary|5.3.15} as a corollary of the Integral Test, Theorem|]5.3.11 


Prove Cases 2 and 3 of the Limit Comparison Test, Theorem |5.3.20 oe 


Prove the Root Test, Theorem |5.3.26 


Prove that limz_..0 


a | re 
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+ 
5.4 Theorems on Series III: 


Series with Positive and Negative Terms 


Non-negative series are relatively “easy” to deal with, for the partial sums always form an 
increasing sequence, which converges if and only if it’s bounded above. Series with both positive 


and negative terms are not so well-behaved. 


We end up classifying convergent series into two kinds: series that converge absolutely and 
series that converge conditionally. A series )* by, converges absolutely if the non-negative se- 
ries }> |by| converges. A series converges conditionally if it converges, but does not converge 


absolutely. 


As we shall see, series which converge absolutely have very nice properties. On the other 
hand, series which converge conditionally are very strange; they do not really correspond to what 
we'd like to think of as the sum of a collection of numbers. For instance, if a series converges 


conditionally, then, by rearranging the terms of the series, i.e., by changing the order of the 


summation, we can make the series converge to anything that we want, or diverge to -too. 


While we may use any of our tests on non-negative series to investigate absolute convergence, 


we will have only one general test for concluding conditional convergence: the Alternating Series 


Test, Theorem ]5.4.17 


Given an infinite series )>°°_,, b;, we know that, by definition, Definition the series 
converges if and only if the sequence of partial sums s, = )>;_,,, bk converges. However, the 
completeness of the real numbers, Theorem tells us that a sequence of real numbers 
(and now we’re thinking of the sequence of partial sums) converges if and only if it’s a Cauchy 


sequence. 


Thus, we conclude 


Theorem 5.4.1. (Cauchy Criterion for Series Convergence) A series \77~_,,, by con- 
verges if and only if, for alle > 0, there exists an integer M > m such that, for all integers 
p and q such that M<p<4q, 


From the Cauchy Criterion, we quickly conclude an important corollary: 
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Corollary 5.4.2. Consider the series \7~__, |bk| formed by taking the absolute value of each 
term of the series \\7—,,, bk. 


Tf im \be| converges, then S>°°_,,, bk converges and 


— So |bel < So be < SS [bal 


k=m k=m k=m 


Proof. Suppose that *7~.,,, |bx| converges. We will use the Cauchy Criterion to conclude that 
Seem De converges. The stated inequality then follows easily from the analogous inequalities 


on the partial sums. 


Let ¢ > 0. As °°, |b, | converges, the Cauchy Criterion tells us that there exists an integer 
M >m such that, for all integers p and q such that M <p<4q, 


q 
Ye lbxl] < «, (5.4) 
k=p 


where the outer absolute value signs are now unnecessary (since the sum is non-negative). 


However, it is an easy exercise to show that 
— Se lbel < So be < So [bel (5.5) 


(which, in words, says something obvious: the sum of some positive and negative terms is less 
than or equal to the sum in which you make everything positive, and greater than or equal to 
the sum in which you make everything negative). 

Now, Formula|5.4] tells us that }*{_, |bk| < € and, negating, that —e < —)%{_,, |b,|. Com- 
bining this with Formula 5.5] we find that, for M<p<q, 


q 
~e< Sok, <6, 
k=p 
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Le., pe b,| <. Therefore, the Cauchy Criterion tells us that S°?°_,,, by converges. 


Example 5.4.3. So, if we take the series 77, roe and negate every term whose denominator 


is a prime number squared, we obtain the series 


= 1 1 Cc 1 1 1 a ae: Ie os 
So be = 2 22 32 | + + T T ote ete 
k=1 


The series Pe by might look impossible to deal with, but it’s easy for us now; if we take 
the series whose terms are the absolute values of the b;’s, we get back an jE which converges 
(it’s a p-series with p= 2 > 1). Thus, Corollary tells us that }77°., by converges. 


Corollary tells us that the convergence of )>~ |by| implies the convergence of S>~ by; 
we give this type of convergence of oa by, a name. 


Definition 5.4.4. If 07°, |be| converges, then we say that )\7~_, by converges abso- 
lutely. 


Tf On Ok converges, but does not converge absolutely (i.e., \~7—_,,, |be| diverges), then 
co eye 
we say that 7, bk converges conditionally. 


Remark 5.4.5. Thus, there are three things that a series can do: converge absolutely, converge 
conditionally, or diverge. For a non-negative series (or a non-positive series), there are only two 


possibilities: absolute convergence or divergence. 


Note that, to show absolute convergence of 577, b;, you have to show one thing: that 
Sem lok] converges. However, to show conditional convergence, you have to show two things: 


that 577°, |be| diverges and, yet, S°7°_,,, b» nonetheless converges. 
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Example 5.4.6. Using our new terminology, what we saw in Example is that the series 
converges absolutely. 


On the other hand, the alternating harmonic series 


ya ae. 1 1 
k 2° 3 4 


converges conditionally. Why? Because, when we take the series with absolute values around 
each term, we get the harmonic series Sree i which we know diverges (it’s a p-series, with p < 
1), and yet we know that the alternating harmonic series converges (to In 2) from Remark|4.4.13 


Example 5.4.7. What about the series 


- + + fo? (5.6) 


We claim that this series diverges. How do you see this? There are several ways, but we’ll look 
at a way that leads to useful general theorem. 


It is easy to spot a convergent subseries hiding inside the series in Formula [5.6} we find the 


convergent subseries 


where we have filled in zeroes for the missing terms; this series converges, since it’s a negated 
p-series with p = 2 > 1 (the extra zeroes change nothing). Of course, the original series is 
formed from this “zeroed out” subseries by adding 


1 1 1 1 
is a ns = 0 --: 
(tere eee os 


which is the harmonic series (with extra zeroes), and so diverges. Now, it follows from Theo- 
rem |5.2.17|that the sum of these last two series, which equals the original series, diverges. 
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The argument that we gave in the example above is completely general and allows us to 


conclude: 


Theorem 5.4.8. Suppose that a series )- ay, has a convergent subseries, where the subseries 
converges to L. Delete the terms of the convergent subseries from \ ax, forming a new 
subseries )>b; from the remaining terms. 


Then, >> b; converges if and only if )> ax, converges and, when they both converge, 
So ax = 1 + ae 


In words, removing a convergent subseries from a series does not change whether or not 
the series converges or diverges, but, in the convergent case, it does change the sum of the 
series by exactly the sum of what’s removed. 


Recall the definition of a permutation of indices and rearrangement of a series from Def- 
inition [5.3.8] The following theorem tells us that rearrangement has no effect on absolutely 


convergent series. 


Theorem 5.4.9. Suppose that we have infinite series SS eeaen Ak. 


1. If 372, ax converges absolutely, then every rearrangement of > p-4, am converges 
absolutely to the same value as \\7—_,, ax. 


. If a does not converge absolutely, then no rearrangement of \\7—_,,, dy con- 
verges absolutely. 


Proof. The “...to the same value” statement is very technical to prove; we refer you to Theorem 
4.3.24 of [4], but we’ll prove the rest. 


First, Item 2 follows from Item 1, for if some rearrangement of }°7°_,,, a, were to converge 
absolutely, then, by Item 1, any rearrangement of that rearrangement would converge absolutely. 
However, 5° 7-.,,, ax itself is a rearrangement of any rearrangement of )>7__,,, a, and so 0p, ax 


would have to converge absolutely. 


Let’s prove what’s left of Item 1. Assume that 5° a, converges absolutely, i.e., that 
k=m g 
em [an] converges. Then, Theorem tells us that, for every permutation r of the indices, 


hem l@r(e)| converges, i.e., O°, G(x) Converges absolutely. 
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Absolute convergence also behaves well with respect to multiplying by a constant or adding 


two series. 


Theorem 5.4.10. Suppose that we have infinite series y7~,,, ax and > \-—,,, bp, which con- 
verge absolutely. Let c be a constant. 


Then, > >7--, car and S>°_,, (ax + be) converge absolutely. 


Proof. The result follows quickly from looking at the partial sums. We find 


n n 
Yo leax| = lel $5 |ax! 
k=m 


k=m 


and, using the triangle inequality, |a + b| < |a| + |b|; hence, 


n n n 


Yo lee + oul S So (lal + [onl) = SO lael + SO lol: 
k=m 


k=m k=m k=m 


The conclusions follow quickly. We leave the remaining details as an exercise. 


In the following theorem, we describe a product of two series, in a way that leads to the 
power series product that we looked at in Theorem |4.6.24} this product, }>c, below, is known 
as the Cauchy product of the series. 


Theorem 5.4.11. Suppose that we have two series yp 9 ax and eyo b;, and that op 9 ak 
converges absolutely. 


Consider the Cauchy product series \+-° 9 Cn, where 


n 
= S- Apbn—p = aobn a aybn—1 = a2bn—2 a eal Gn—1b1 =F An bo. 
p=0 


co 
converges, then )> 7 Cn converges. 


converges absolutely, then >> 9 Cn converges absolutely. 
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Proof. See Theorem 4.3.27 and Exercise 4.3.40 of [4]. 


Example 5.4.12. Of course, it’s easy to find examples in which taking a Cauchy product of a 
convergent series with a divergent series yields a divergent series; you are asked to produce such 
an example in Exercise [30] 


However, something that you may think would be true is not; in Exercise[31| you are asked 
to find two conditionally convergent series whose Cauchy product diverges. 


The Ratio and Root Tests for Non-Negative Series, Theorem and Theorem 
immediately yield tests for absolute convergence, simply by inserting absolute value signs. Typ- 
ically, you cannot conclude that series a )~ by, diverges by showing that }~ |b;,| diverges; the 
series could still converge conditionally. However, recall that the divergence conclusions of the 
Ratio and Root Tests for Non-Negative Series are arrived at by showing that the terms do not 
approach 0, i.e., that the series diverges by the Term Test for Divergence, Theorem [5.2.13] But, 
if |b,| does not approach 0, then neither does b;, and so, if }>|b;| diverges by the Term Test, 
then so does 5° by. 


Thus, we immediately conclude the general Ratio and Root Tests. 


Theorem 5.4.13. (The Ratio Test) Consider the series S- bp. 


k=m 


1. If there exists r <1 (and, necessarily, > 0) and an integer M > m such that, for all 
k>M, 


br # 0 and 


then the given series converges absolutely. 


2. If there exists an integer M > m such that, for alk > M, 


br+1 


br # 0 and 
br 


then the given series diverges. 
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b 
In particular, suppose that jim wa exists, as an extended real number; call its value L. 
—0O k 


Then, 
a. if L <1, the given series converges absolutely; 
b. if L > 1, including L = ov, the given series diverges; 


c. if L =1, the given series may converge or diverge. 


Theorem 5.4.14. (The Root Test) Consider the series S- bp. 


k=m 


1. If there exists r <1 (and, necessarily, > 0) and an integer M > m such that, for all 


k>M, 
Vibkl <7, 


then the given series converges absolutely. 


2. If there exists an integer M >m such that, for allk > M, 
Viol 2 1, 


then the given series diverges. 


In particular, suppose that jim 4/|bz| exists, as an extended real number; call its value L. 
oo 
Then, 


a. if LD <1, the given series converges absolutely; 
b. if L > 1, including L = ov, the given series diverges; 


c. if LD =1, the given series may converge or diverge. 


Example 5.4.15. Consider the series 
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To use the Ratio Test, we calculate the limit 


(—2)*+1 
(k +1)! ee ee ; 
= ———_— = ——_—_ + —— — ] —— = 1. 
BD ee ae Cole GP) espa 


k! 


Therefore, the Ratio Test tells us that the series converges absolutely. 


We can show the same thing with the Root Test. We calculate 


eae YE O 


where, in the last step, we used that limp... Vkl = co, from Corollary Thus, the 
Root Test also tells us that the series converges absolutely. Of course, after we proved absolute 
convergence by the Ratio Test, we could have simply used Proposition [5.3.29] to conclude that 
the Root Test would yield the same limit. 


We know that the alternating harmonic series 


 (—1)"*" 1 1 
S- an han i foe 


converges, because we showed in Theorem|4.4.12/that the Maclaurin series for In(1+2) converges 
and is equal to In(1 + 2) at # = 1, and this yields the convergence of the alternating harmonic 


series to In2. 


We have also discussed the fact that alternating harmonic series converges conditionally, 
because it converges, but does not do so absolutely, since the harmonic series itself - what you 
get when you take the absolute values of the terms — diverges. 


In fact, up to now, we have had no good tests for concluding the convergence of a series 
which does not converge absolutely. But we shall give one now. It is the Alternating Series 
Test, and it applies to series, like the harmonic series, in which the signs of the terms alternate, 
Le., gO +,—,+,—,+,—,°°: OF —,+,—,4+,—,+,°°° 
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Definition 5.4.16. A series cP, ax is alternating if and only if either a, = (—1)*|ag| 


for allk > m, or ay = (—1)*"ax| for all k > m, i.e., if and only if the terms of the series 
alternate between being non-negative and being non-positive. 


Our primary interest in alternating series stems from: 


Theorem 5.4.17. (Alternating Series Test) Suppose that a series )-7_,,, ax satisfies the 
following three conditions: 


1. 37. ax is an alternating series; 
2. the terms are decreasing in absolute value, i.e., for all k > m, \ax+i| < |axl; 


3. the terms approach zero, i.e., limp—+oo dy = 0 or, equivalently, limp —oo |ax| = 0. 


Then, the series \\7-_, an converges. 


n 


Furthermore, if we let 84. denote the sum of the series, and let 8) = dip, Gk, then Sn 


approximates s to within |an+1|; more precisely, 


|Sc0 —_ Sn| < |an41|. 


In words, this says that the partial sum approximates the infinite sum to within the absolute 
value of the next term that’s not in the partial sum. 


Proof. This proof is not difficult. We give the main ideas and leave the details as an exercise 


for you. 


First note, since the terms are decreasing in absolute value, that if a term is zero, then all 
of the subsequent terms must be zero, which would make the infinite sum actually a finite sum; 
convergence would be immediate, and the inequality would not be difficult to establish. So, in 


what follows, we will assume that all of the terms are non-zero. 


We will assume, for convenience, that m = 1; this is clearly an unimportant assumption. We 
will also assume that the first term, a,, is positive. The case where a, is negative will follow by 


negating the terms in the proof that we give for the case where a, is positive. 


It is easy to show that the odd partial sums s2,41, for n > 0, form a decreasing sequence, 
which is bounded below by a, + ag; thus, by Theorem 5.1.23} the sequence s2,,4;; converges to 


some limit £1, and ay + a2 < [y. 


606 CHAPTER 5. THEOREMS ON SEQUENCES AND SERIES 


Similarly, it is easy to show that the even partial sums s2,, for n > 1, form an increasing 
sequence, which is bounded above by ay; thus, by Theorem 5.1.23} the sequence sg, converges 


to some limit D2, and D2 < ay. 


We need to show that DL; = L2, which would show that all of the partial sums converge to 


this common limit. But this is easy, because 


|L, — L| = lim |Sont1 — S2n| = lim |don+1| => 0. 
noo n—0o 


Therefore, we have shown that a ax converges to Ly = Le, which is < a,. In the 


case where a; may be positive or negative, what this shows is that }>?°.,,, ax converges and 
lo e) 
Date ax| < Jai]. 


To show the inequality in the theorem, note that 


co 
Soo — SH = ) Qk; 


k=n+1 


and this new series, which starts at an+1 is itself an alternating series which satisfies the re- 
quirements of the Alternating Series Test. Therefore, from our inequality above, which uses the 


first term of the alternating series, we immediately conclude that 


co 
S00 — Snl = | D> ax] < landil. 
k=n+1 
Example 5.4.18. Consider the alternating p-series, 
lo) 
1 1 1 1 
Soe te = 1 
kp 2p 3p 4P 


k=1 


e Ifp <0, the Term Test for Divergence, Theorem |5.2.12} tells us that the series diverges. 
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e Ifp>0, then the Alternating Series Test tells us that the series converges. 


e The p-Series Test, Corollary |5.3.15| tells us that 


converges if p > 1 and diverges if p < 1. 


oo 1 : 
Therefore, the alternating p-series, )7 7°. ,(—1)*~1 pp? converges absolutely if p > 1, converges 
conditionally if 0 < p < 1, and diverges if p < 0. 


For p > 0, we can easily estimate the entire infinite summation by the partial sums. For 
instance, the last part of Theorem ]5.4.17| tells us that 


le. 


Recall now our discussion of power series as functions from Section [4.5] In that section, we 
gave you one convergence/divergence test: the Ratio Test. This enabled us to determine the 
radius of convergence of power series, but could never tell us what happened at the endpoints 


of the intervals of convergence. 


At long last, we are in a position to consider examples in which we determine the entire 
intervals of convergence of some power series, including what happens at the endpoints. Recall 
that power series converge absolutely on the interior of the interval of convergence, but, at an 


endpoint, it is possible to have divergence, conditional convergence, or absolute convergence. 


Example 5.4.19. Determine the intervals of convergence of the series 
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and determine whether any convergence at an endpoint of the intervals of convergence is absolute 


or conditional. 


Solution: 


As in Section [£5] we use the Ratio Test, Theorem]5.4.13} to quickly determine the radii of 
convergence. Because of the absolute values, we can drop the minus signs and find 


(x — 2)*+1 
k+1 k k — 9|krl 1 
lp = tim |2 vE+1 ie a VE |e i = Phe 3 
Bk Vk 
and 
Trtl(g + 1)et1 
(k+1)2 . fro ke jc + 1]*t? 
1 paool (a+ DF ooo | 7 (e+) [a+ 1/F er 
k2 


Therefore, p(x) converges absolutely if |a — 2|/3 < 1, and diverges if |” — 2|/3 > 1, ie., 
converges absolutely if |a — 2| < 3, and diverges if |x — 2) > 3. Similarly, g(x) converges 
absolutely if 7]z+1| < 1, and diverges if 7|z+ 1] > 1, i.e., converges absolutely if |x +1] < 1/7, 
and diverges if |” + 1] > 1/7. 


Thus, the interiors of the intervals of convergence of p(x) and q(x) are, respectively, the open 
intervals (2 — 3,2 +3) = (—1,5) and (—1— 4,-1+ 4) = (-8/7,—6/7). The question is: what 
happens at the endpoints of these intervals? 


The beginning of the answer is: you plug the endpoint x values into the power series, look 
at the resulting series of constants, and use tests other than the Ratio Test. 


e So, let’s consider the series 


= 3k/k 
when we plug in the endpoints of (—1,5), ie., when x = —1 and when x = 5. 
When «x = —1, we find the series of constants 


} <2 Se 
DO See CN = ae 


5.4. SERIES WITH POSITIVE AND NEGATIVE TERMS 609 


This is a p-series, with p = 1/2, and so it diverges by the p-series Test, Corollary |5.3.15 


When x = 5, we find the series of constants 


Dnt seg = LC 


k=1 


sf 


This is an alternating series, which converges by Theorem]5.4.17} note that the series does not 
converge absolutely, as taking the absolute values of the terms would yield the previous divergent 
p-series with p = 1/2. 


Hence, we find that the interval of convergence of p(x) is the interval (—1,5], and the con- 


vergence is conditional at x = 5. 


e Now, let’s consider the series 


So) yuan 


k=1 


when we plug in the endpoints of (—8/7, —6/7), i.e., when « = —8/7 and when x = —6/7. 
You can quickly see that, when we plug in these endpoints, we get 07. @ and )P., oe) 
respectively. These both converge absolutely, by the p-series Test, Corollary |5.3.15} with p = 2. 


Hence, we find that the interval of convergence is the closed interval [—8/7, —6/7], and that 


q(x) converges absolutely everywhere in the interval. 


As the final result of this section, we wish to state, and give an idea of the proof of, a result 
that tells us that the summation in a conditionally convergent series does not correspond to 
what we normally think of as adding together a collection of numbers; the theorem tells us that 
the order in which we add the terms in a conditionally convergent series can have a dramatic 
effect on the summation. This is not the case for absolutely convergent series, as we saw in 
Theorem [5.4.9 
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Theorem 5.4.20. Suppose that we have infinite series \\7-_, ax, which converges condi- 
tionally. Let L be any extended real number. 


Then, there exists a rearrangement of the series >, Ar(p) such that 


S- Gr(k) = L. 
k=m 


In words, a conditionally convergent series can be rearranged to converge to anything 
whatsoever, or diverge to oo. 


Proof. The details of the proof can be found in Theorem 4.3.26 of [4], but will try here to 
indicate how such a thing is possible. 


Let P denote the sum of the positive terms of Syoaiee az, taken in order, and let N denote 


the sum of the negative terms of es dz, taken in order. 


P and N are defined by non-negative and non-positive subseries, respectively, and we may 
assume that we have inserted zeroes for each “missing” term of each subseries. This enables us 
to write P = Mc by, and N = pon Ck, where by > 0, cy < 0, and az = be + cy. 

We claim that P = co and N = —oo. Why is this true? As 5>b, and >> cy, are non- 
negative and non-positive, respectively, the only choice other than equaling too is that the 


series converge. But, if either one of these series converges, then, since )>°_,,, a, converges, 
Theorem |5.4.8] tells us that the other series must also converge. However, as )> b, and )> cx are 
non-negative and non-positive, the only way for them to converge is absolutely, which would 
imply, by Theorem [5.4.10] that their sum 5> a, converges absolutely. This would contradict 
that 5° az, converges conditionally. 


Therefore, the sum of the positive terms of }> az is 00, and the sum of the negative terms is 


—o. 


What does this have to do with anything? Well...intuitively, things are fairly easy now. If 
you want to produce a rearrangement of }> a, that diverges to co, then you simply put lots of 
(sizable) positive terms first, then a negative term, then lots of positive terms, then a negative 
term; the point being that by adding together lots of positive terms early, you can overwhelm 
the sporadic negative terms. You have to have every negative term in the series somewhere, 
but they can come after big chunks of positive terms that add up to arbitrarily large positive 


numbers. 


Similarly, to produce a rearrangement that diverges to —oo just take lots of (sizable) negative 


terms early, then a positive term, and keep going. 
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Once you believe that there are rearrangements that yield co and —on, it’s easy to believe, 
if you were very careful with picking your rearrangement, that you could make it add up to 


anything between —oo and ov, i.e., make the rearrangement add up to any real number what- 


soever. 


5.4.1 Exercises 


In each of Exercises 1 through 15, determine whether the given series converges 
absolutely, converges conditionally, or diverges. We omit the starting and final 
values for the index on the summation; assume the starting index is big enough so 


that all of the terms are defined and the final value is oo. 
sink 
aD a © 
—2+sink 
2. — 
pee 


3. S —2 venk 


jd 


~J 
a ™~ 
a, 
rea 
wa 
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es 
LY] or 
bo 
+ 
w 
Es 

— 


oo 


ey 
2 3(k!) +4 e 
(IF 
2h, J/3(kl) +4 
—1)F 
ue Dd, an 


u.S° (3+ =) 
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cos(km/2) 
a ak kink — 


13. 3 wee 


vk +1 
14. $°(-1)*- ae 


iV +1 
15. So (-1)*- JET 


In each of Exercises 16 through 19, find the first 5 terms of the Cauchy product 
of the two given series; see Theorem |5.4.11} Also prove that each Cauchy product 


converges. 


foe) co 1 
16. $°(-1)' 5 and 5 °(-1) 

= (k+1) oo j+1 
- 3 (k : i y-1) a 

k=0 j=0 J 

| = oe 
18. 50 7 and S°(-1) = 

k=0 j=0 


19. ya and ys = 


In each of Exercises 20 through 23, show that the alternating series converges, and 
give a partial sum which estimates the entire sum of the infinite series to within 
0.01. 


3 mail 
k= 


= 1 
21. S$ (-1)*-1— 
k= Vk 


= Lal 
22. S°(-1)' oe 
k= 


_ 1 
2 oa 
7S ko 
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24. If }° a, converges absolutely and 5>b, converges absolutely, what can you say about 
di (an + bn)? 

25. If }> a, converges absolutely and 5° b, converges conditionally, what can you say about 
(Gn a by)? 

26. If }> a, converges conditionally and S~ b, converges conditionally, what can you say about 
(an + bn)? 

27. If }> a, converges absolutely and }> b, diverges, what can you say about >>(an + bn)? 

28. If }> a, converges conditionally and }> b, diverges, what can you say about )>(an + bn)? 

29. Complete the proof of Theorem]5.4.10 

30. Give an example of an absolutely convergent series and a divergent series such that the 
Cauchy product (see Theorem ]5.4.11) of the two series diverges. 

31. Find two conditionally convergent series whose Cauchy product diverges. 

32. Fill in the details of the proof of the Alternating Series Test, Theorem |5.4.17 

33. Write a brief essay, in which you discuss the ways in which conditionally convergent series 
do not behave like “normal” sums of numbers. 

34. Suppose you have a power series p(x) = )>~°., cx(a — a)*, with radius of convergence R. 


Prove that, if p(a) converges absolutely at one end of the interval of convergence, then 
it converges absolutely at the other end. Is the same statement true if “absolutely” is 
replaced both times with “conditionally” ? 


In each of Exercises 35 through 42, you are given a power series from the Exercises 


in Section for which you may have already found the radius of convergence. 


Finish finding the precise interval of convergence by determining what happens at 


the endpoints of the interval, i.e., if the power series has radius of convergence R, 


where 0 < R < ow, and is centered at a, determine whether the series converges 


absolutely, converges conditionally, or diverges at x =a— Rand x=a+R. 


35. 
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36. 
p(z) = > Vk (e@+3)* = (2 +3) + V2 (e438)? + V3(e43)° + V4(a43)° 4+ +. 
k=1 
a7 
= k 1 x x? x x 
= je 
P(z) 2 Pa) ~ 27 Bea + Be2 7 BQ + Bp 
(a — 4)* 
38. P(x) ~ 2 Tk+1 
k=0 
(x —4)§ 
39. p(x) = 3 Tkt+lk 
= 4, (2 +7)" 
40. p(w) = S°(-1)**1 me 
= 41 Ria + 7)* 
41. p(x) = 5 (-1)**1 uk 
oo k 
10 'ele) = Sry 2 
p(x) dS ) Th 


Appendix A 


An Introduction to Vectors and 
Motion 


A brief introduction to vectors in 2 and 3 dimensions 


A vector is frequently described in physics and engineering classes as something which has 
a magnitude and a direction. Typical physical vector quantities in the plane are things such as 
“move north 3 miles”, “travel west at 5 meters per second”, and “a force of 10 pounds pushing 


south”. 


A vector in the real line, R, the zy-plane, R?, or xyz-space, R°, is usually represented by 
an arrow, which points in the direction of the vector and has length equal to the magnitude of 
the vector (even if vector itself does not have length units). However, the arrow representing a 
given vector has no fixed starting and ending point; parallel arrows which have the same length 
and which have their tails and heads at corresponding ends (i.e., point in the same direction) 


represent the same vector. 


A vector in the real line is simply a real number r. The magnitude of r is the absolute value 
|r|. There are two possible “directions” for a vector in the real line; if r > 0, then r has the 
positive direction and, if r < 0, then r has the negative direction. The zero vector r = 0 is said 
to have every direction (in this case, both directions); this is convenient in statements of some 
results, and makes sense, in that, moving 0 feet in the positive direction is the same as moving 


0 feet in the negative direction. 


We shall now discuss vectors in R?. The analogous statements and definitions for vectors in 
R® should be clear, and we will explicitly state many of them after our discussion about vectors 


in the plane. 


—___> 
We denote the arrow from the point (a,b) to the point (c,d) by (a,b)(c,d). Two arrows 
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Figure A.1: Arrows which all represent the same vector. 


a 
(a1, 61)(c1,d1) and (a2, b2)(c2,d2) have the same direction and magnitude, i.e., represent the 


same vector, if and only if, when we “drag them to where they both start at the origin”, we get 
es SEER 

the same arrow. This means that (a1, b1)(c1,d1) and (a2, b2)(c2,d2) represent the same vector 

if and only if 


(0,0) (cx — a1, d, — by) = (0,0) (ce — ag, dg — be), 


i.e., if and only if c, — a, = co — ag and d, — by = dz — bg. This means that, if we set our default 
starting point for arrows as the origin, we can specify vectors simply by specifying the ending 
points of the arrows. Thus, given an ordered pair of real numbers, like (2, —5), we may refer to 
(2, —5) as a point, and picture it as a point, or we may refer to it as a vector, and picture it as 


the vector represented by an arrow from the origin to the point (2, —5). 


Vectors are frequently denoted by including little arrows over variable names, e.g., the vector 
v = (2,—5). Note that it is not common to put an arrow over (2, —5) itself; we either explicitly 
state that (2,—5) is a vector, or let the context determine whether (2,—5) is being used as a 


point or a vector. 


If we have a vector U = (a,b), we refer to a as the x-component of U and to b as the y- 


component of v. 


The magnitude, |(a, b)|, of a vector (a,b) is the Euclidean length of the arrow from the origin 
to the point (a,b), i.e., |(a,b)| = Va? + b?. Note that the use of “absolute value” signs for the 
magnitude does not lead to confusion, for, as we discussed above, a vector in the real line is 
simply a single real number, and the analogous notion of magnitude would be that the magnitude 
of a real number a is Va?, but this is the same as the absolute value of a. 


There is only one vector (in the plane) whose magnitude is zero: the zero vector 0 = (0,0). 


A unit vector means a vector whose magnitude is 1. 
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We define two other operations on vectors, vector addition and scalar multiplication. The 


sum of two vectors (a,b) and (c,d) is defined by adding the corresponding components: 


(a,b) + (c,d) = (a+c,b+d). 


The fact that we are adding the ordered pairs means that we are using them as vectors, not 
points. The scalar multiplication r(a,b) of the vector (a,b) by a real number (a scalar) r is 
defined by 

r(a,b) = (ra,rb). 


> 


1 
If r £0, then (=) v is frequently written as . 
r r 


From the definition of scalar multiplication, it follows quickly that |r(a,b)| = |r| - |(a, 6)]. 
Multiplying by a positive scalar leaves the direction of the vector unchanged, but “scales” the 
magnitude because, if r > 0, then |r(a,6)| = |r| - |(a,6)| = r|(a,b)|. Multiplying by a negative 
scalar produces a vector which points in the opposite direction from the original vector, but the 
magnitude is scaled by the absolute value of the scalar; thus, —2v has twice the magnitude of 


v, but points in the opposite direction. 


The negation, —v, of a vector v is equal to —1v. Subtraction of vectors 0 = (a,b) and 
wW = (c,d) is defined component-wise, so that U — W = (a—c,b—d). This is written in terms of 


vector addition and scalar multiplication as 


Note that the distance between two points (a, b) and (c,d) is simply the magnitude of the vector 
(a, b) = (e; d) (or (; d) v (a, b)). 
~ j : F F patie 2 U 
If ¢ # 0, then there is a unique unit vector in the direction of @, namely, a 
U 
since most directions in the plane, using your intuitive notion of “direction”, do not have some 


In fact, 


predetermined name, the direction of a non-zero vector U is defined to be the unit vector a 
v 
As in the case of “vectors” in the real line, we say that the zero vector 0 has every direction. 


The dot product of two vectors ¢ = (a,b) and w = (c,d) is defined by 


(a,b) - (c,d) = ac+ bd, 


i.e., you multiply corresponding components and add. Note that the dot product of two vectors 
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is not a vector; it is a real number (a scalar). The dot product has a number of nice properties: 


for all vectors wi, 0, and w, and scalars a and b, 


1. (commutativity) 0-w = wW-d; 
2. (distributivity/linearity) w- (av + bw) = a(u- v) + b(u- w); and 


3. 0-8 = |o?. 


However, the reason that the dot product is of great interest is because it has the following 
geometric interpretation 


U-W = |d| |W] cos, 


where @ is the angle between the vectors v and w. In particular this means that v and w are 
perpendicular if and only if ¢- w= 0; note that our convention that the zero vector has every 


direction makes this statement true even if v or wW is the zero vector. 


Vectors in R* work just like vectors in R?, except that you now have three components, 
instead of two. We also frequently refer to R? as being contained in R® by thinking of the point 
or vector (a, b,0) as being the same as the point or vector (a,b), i.e., it is standard to think of 


the xy-plane as being the same as the plane where z = 0 inside ryz-space. 


Given an ordered triple of real numbers, like (2, —5,3), we may refer to (2, —5,3) as a point, 
and picture it as a point, or we may refer to it as a vector, and picture it as the vector represented 


by an arrow from the origin to the point (2, —5, 3). 


If we have a vector U = (a, b,c), we refer to a as the x-component of U, to b as the y-component 


of @, and to c as the z-component. 


The magnitude, |(a,b,c)|, of a vector (a,b,c) is the Euclidean length of the arrow from the 
origin to the point (a, b,c), i-e., |(a, b)| = Va? + b? + c?. 
There is only one vector in R? whose magnitude is zero: the zero vector 0 = (0,0,0). A unit 


vector means a vector whose magnitude is 1. 


We define vector addition and scalar multiplication as you would expect: the sum of two 


vectors (a,b,c) and (d,e, f) is defined by adding the corresponding components: 


(a,b,c) + (d,e,f) = (a+d,b+e,c+ f). 


The scalar multiplication r(a, b,c) of the vector (a,b,c) by a real number (a scalar) r is defined 
by 


r(a,b,c) = (ra,rb,rc). 
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ep : : : . F . ~ VU . 
If « £ 0, then there is a unique unit vector in the direction of v, namely, la As in the case 
U 


of “vectors” in R and R?, we say that the zero vector 0 has every direction. 


The dot product of two vectors U = (a,b,c) and w = (d,e, f) is defined by 


(a, b,c) - (d,e, f) = ad + be + cf, 


i.e., you multiply corresponding components and add. The dot product in R®° has the same nice 


properties as it did in R?: for all vectors w, v, and w, and scalars a and b, 
1. (commutativity) 0-w = W-d; 
2. (distributivity/linearity) w- (av + bw) = a(u- v) + b(u- w); and 


3. 0-0 = |v 


In addition, it is still true in R? that 


U-W = |b] |W] cos, 


where @ is the angle between the vectors v and w. 


Vector-valued functions 


Suppose we have a function p(t) = (a(t), y(t)) or p(t) = (x(t), y(#), z(t)), whose domain is a 
subset of the real numbers and whose codomain is a subset of either R? or R?; this is a vector- 
valued function. The functions x(t), y(t), and z(t) are the component functions of p(t). Such 
a function p(t) is continuous if and only if each of its component functions is continuous; such 


a function is differentiable if and only if each of its component functions is differentiable and, 


d 
when this is the case, the derivative = = p'(t) is defined by 


Bt) = fim MEAD PO _ er). y') oF (e'.¥'W).2(). 


Differentiation of vector-valued functions is linear: if p(t) and q(t) are differentiable vector- 
valued functions, both in R” for the same n, and a and b are constants (scalars), then a p(t)+b q(t) 
is differentiable and 


(aplt) + q(t)’ = ap’(t) + dg'(). 
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More surprising is that there is a product rule for the dot product: if p(t) and q(t) are dif- 
ferentiable vector-valued functions, both in R” for the same n, then p(t) - g(t) is differentiable 


and 


An immediate consequence of this is that, if p(t) is differentiable and always has magnitude 1 
(i.e., is always a unit vector), then 1 = p(t) - p(t) for all t; therefore, differentiating with respect 


to t and applying the Product Rule, we find 


0 = 2p) - (0), 


so that p(t) is always perpendicular to p’(t). 


Motion in the plane and in space 


We will now discuss motion in the plane and in space. Throughout our discussion, we shall 
use notation for the case of motion in R®. The case of motion in R? is analogous; simply ignore 


the z-component in what we write, or take the z-component to be zero. 


Suppose that an object is moving in R*. Then, at each time t, the object has an x-coordinate, 
x(t), a y-coordinate, y(t), and a z-coordinate, z(t). The functions x(t), y(t), and z(t) are said 
to describe a parameterized curve in space. The position vector (or, simply position), p(t), of 
the object, at time ¢, is simply the vector (x(t), y(t), z(t)). In place of x(t), y(t), and z(t), it is 


common to write p,(t), py(t), and p.(t), respectively, for the x-, y-, and z-components of p(t). 


If the position function p(t) is differentiable, the velocity vector (or, simply, velocity), U(t), 
of the object, at time t, is the derivative of the position vector, i.e., 
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Figure A.2: Velocity vectors are tangent to the curve defined by p(t). 


It is standard to indicate the velocity vector v(t) as having its initial point at the point p(t); 
if the velocity vector (to) is non-zero, it will be tangent to the curve defined by p(t) at the 
point p(to). 

The magnitude |v(t)| of the velocity is the speed of the object at time t. See Section [3.3] 
for why this notion of speed is equivalent to the instantaneous rate of change of the distance 


traveled, with respect to time. 


The acceleration vector (or, simply, acceleration), a(t), of the object, at time t, is the deriva- 


tive of the velocity vector, i.e., 


provided that a(t) is differentiable. 


Appendix B 


Tables of Integration Formulas 


In the tables below a, b, p, and C’ denote arbitrary real constants, except that, in the formulas 
involving b” and log,, we assume that b > 0, and in the formulas involving the inverse trig 
functions, we assume that a > 0. We use f and g to denote differentiable real functions, and u, 
v, and x to denote variable names, either independent variables, or dependent variables, i.e., the 
values of functions. In any formulas involving divisions, we assume that we are in a situation 
that does not lead to division by zero. More generally, we assume that we are in a situation 


where all of the functions involved in the formulas are defined. 


Formulas for Reducing Complicated Integrals to Easier Ones 


Linearity: (af + bg)’ (x) = af'(x) + bg’ (2). 
Integration by Parts: (f -g)'(x) = f(x)g’(x) + g(x) f'(2). 
: ap oog dv dv du 
Integration by Substitution: | (fo g)/(x) = f’(g(x))-g'(x) or ae ae ae 
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APPENDIX B. TABLES OF INTEGRATION FORMULAS 


Algebraic Integrals 


x bY 


gett 1 
[ oar= +C, ifp#-1 i: —dx = In|a|+C. 
pt+l x 


ferdx = e? +C. 


Trigonometric Integrals 


f cosrdx = sinz+C. fsnadr = —cosxr+C. 
fsec?adx = tanz+C. fcscPadx = —cotx+C. 
fsecatangzdr = seca+C. || f cscacotrdr = —cscx+C. 


Inverse Circular and Hyperbolic Trigonometric Integrals 


1 x 
dg = sin ( Oo) EC; a 0, 

-_ a 

en x2 ( ) 


1 1 : 
loam => ~ tan” (F)+<, ifa 40. 


1 1 ey te : 
lwe® 7 8c (F)+¢, ifa #0. 


1 - I x . 
l= = sinh (F)+¢, ifa>0. 


1 
lw" = cosh~ + (F)+¢, ifx>a>0. 
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Iteration Formulas for an integer n > 2 


i 


1 1 
sin" @d9 = ——sin™-16cosd + na f sin? oa 
n n 


/ 


al —1 
cos” @d9 = —cos”!@sin@ + "a f cos"? 008. 
n n 


Appendix C 


Answers to Odd-Numbered 
Exercises 


For producing answers to various exercises or for help with examples or visualization, you may 


also find the free web site }wolframalpha.com| very useful. 


Chapter 1 


Section 1.1 


. 32° + 20? +9n+C 

. —5cost —3sin"'t+C 

_Injy|+tan-+y+C 

. 1/2sin(20—-1)+C 

.1/2In|r? — 4, +C 

. (5w — 3)'°'/505+C 

. —(a + 2)(a + 5) + (w@ + 7)(x + 2) nlx + 2/ 4+ C 
. 5Inflna|+C 


. —In|cos 6| + C 
5 3/2 
20 +637+C 
.sin’'(2—3)+C 
. P(w) = 5 In|w| — Ze” + (9/2)w*/3 + 7e7! — 9/2 
_K(v) = —v71 4+ Info| — 2u7 7? 4-1 


/3a3 — xcosa@+sing +7 — 1/373 


1 
= 2/3(@ +1)? — 2/5(@+1)9/?7 +17/6 


. R(t) = -1/2e!-” - 1/o? +241 
. (x —5)?e” — 2[ (a — 5)e” —e"] +C = 


(a? — 12@ + 87)e7+C 
35. t? sint + 2tcost — 2sint +C 


Sie er 
39. —e?*[2sin(5x) — 5cos(52)] + C 


41. 5 Cu tan 'w+tan'w—w)+C 


43. 
a. p(t) = —5t cos(2t) + 2.5 sin(2t) + 20 
b. p(4) & 25.3833963 


45. p(t) = t? — 247 +3t4+2 
47. p(t) = 2/21(18 + 7t?)9/? — 82/21 
49. 


a. —sinxcosxz + f cosxcosxdx 
b. —sinacos#/2+2/2+C. 

c. sinacosx/2+4+2/2+C 

d. (sin(2x))/4+2/2+C 


57. (tan ~'(a/a))/a+C. 
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61.e6 +C 
63. 25+C 
65. aIn(1 +a?) —2a+2tan7!(x) +C 
67. (1 — cost)/12 


69. (In(t? + 1))/6 


gett 
nt+l1 


gnti 


a. ft" ntdt = Coane 


b. (Int)?/2+C 


Int tC 


77. 


maximum speed is 0.5 mi/h 

a(t) = 1/2sint — 1/6sin* t. 

y(t) = (x(t)? = (Zsint — } sin? ¢)? 
. O mi/h 


Bo op 


79. v= /24t4 v0, p= t?/64 47/2 + vot + po 


81. v = —e~*/3 + v9 1/3, p = e*/9 + (vo + 
1/3)t + po — 1/9 


Section 1.2 


1. 7 In| sec(38)| +C. 


~ 


1 1 
Z In| sec(4y) + tan(4y)| + 3! sec(3y)| + C. 


5. In|sin(sinx)|+C. 


oa 


1 
—$ In| eset + cot — 5 csctcott + C. 


9. 5(v25 Py? 40. 


k 1. 


11. 
242(121 +k?) " 2662 


tan” '(k/11)+C. 


? sin™'(7v/V10) + C. 


2 

2(5+527) 3, 4 
15. — 4+ -=-t 2 
5 (a z+C 


17. —yv—cotv+C. 
Da 
19. /e2* — 16 —4tan™! we AG: 
21. Argument is similar to the iterated cos @ for- 
mula, which is derived in the text. 
cos(ay + by) 
a+b 


1 | cos(ay — by) 
2 a—b 


23. +C. 


25. 22 and 24 become integrals of sin? (az) 
and cos?(bx), resp. 23 is just the integral of 
cos(ay) sin(ay). 

27. Idea: use integration by parts. 

29. 2ycosy + (y? — 2)siny+C. 

31. (32? — 6) cosz + (z? — 6z)sinz+C. 


38y 61 3). 
33. =o qon w cosy — T6 sin2y+C. 
ae) 
35. w ane | = _ +C. Answers may 
vary based on how 1 + tan? u = sec? wu is applied. 
sin3de  sindx 
37. a + “10 +C, 


39. p(t) = -3(16 — £7)°/? (32 + 387) + 3372.8. 


Al. p(t) = (10¢? — 180)/#? + 9 + 540. 


cos5a% cos3x cosxz 
43. 30 78 3 C. 


45. Liem? y + C or Dae tC. 
2 2 
9-— +? 


47. V44+a2 = ; 
49, J(1—27)3/? = (2 — 18) /22 +940. 


sin *(t/3) + C. 


Section 1.3 
1. 4In|a+ 2) + 7ln|a2+3/+C. 
3. 5Injul +3ln|ju—7|4+C. 


5. ; In |2z 4 


1]+3In|z—1]4+C. 
1 2 
7. =p’ —2p4+9ln|p+3|/+C. 


2 
a +424 


2 
: 3lnja+5/+C 
9 T T . 


11. 5ln|m4 
13. In|v? 
15. 2(In|2¢ 4 


3} +n|jm?+m41/+C. 
4v + 18] —5In|ju—6|+C. 
3| + In|? + 5t + 30|) + C. 


17. —3n|y +12] — 5 tan (4) Lay? 
4y + 20| + C. 


19. = In| + o+5 
6In|¢+2|+C. 
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21. Inju+3|]/+4In|u— 2| + C. 
u— 2 
2 
23. Injx+3| 4In |x + 2)+C. 
r+3 
25. 3ln|r—1| - : + 4In|r+3|/+C 
, r—1 (r—1)2' ‘ 

A fal x ) C za x ) 
27. tan tan —]4+K. 

jp \ye)* D VD 
29. y(a) = 2? +a —In|x —5|+In|a +3] —In9. 
31. tan”? (4) + 3In|y| = 3ln|a#+5|-—4In|x4 
6) +C. 


33. See the ”Separable Differential Equations” 
chapter of Worldwide Differential Calculus for a 
complete proof. 


35. 2/x+3ln|Je— 3) —3ln|/Vz4 3) +C. 
37. 3( Ye 4 sans Linley, 


39. 2/3+2+4tanh7! Wr +3+C. 


12(1—e* 


43. Approximately 1.552 mols per unit volume. 
45. 


a. a = —5, 6 = 1 is trivial. Use the product 
rule to calculate Q’(z). 


b. Substitute the variables into the formula in 
the explanatory paragraph. 


7. sinh~'(x?/3) + C or In(2z? + Va? +9) 4+. 
9. Yiu t ea(ee ade 8) AO: 
ie; 227? + In(t+ Vt24+3)+C. 


13. sinh” *(2/3) + cosh™*(2/2) + C or In(x + 
z?+9)+In(a+ Vx? —4)+C. 
15. 


a. (cosh? x)/2+ Cy = (sinh? x) /2 + Co. 


b. Same as (a). 


The two answers coincide. 


17. Prove sinh(—x) = — sinh(x) by using the def- 
inition of sinha in terms of the exponential func- 
tion. 


19. sinh(x) is one-to-one since its derivative, 
coshz, is strictly positive. The formula for 
sinh~’ x gives an explicit inverse formula with do- 
main equal to all real numbers. 


21. sinh z is strictly increasing since its derivative, 
cosh a, is strictly positive. The second part follows 
immediately from the fact that sinh0 = 0. 


23. Prove this by differentiating the definition of 
sinha using what you know about the derivative 
of the exponential function. 


25. Prove this by writing sinha and cosh in 
terms of the exponential function. 


c. Evaluate the limit via direct substitution. 27. 5 (22 1)Vax2—-1 ; cosh ta+C. 
47. A=1,B=-2.5,C =2.5. : 
29. © (20? + a?)s/2? + a2 — © sinh! (a/a) +C. 
49. —1. ° 8 
31. -5 cosh~* (52) + eaey 2522-1 — 
1 4/25 —-14+C. 
Section 1.4 8 


Many of the integrals have equivalent but 
different looking answers depending on 
whether the are expressed in terms of hy- 
perbolic trig functions or logarithms. We 
indicate both answers for some but not all 
of the problems. 


1. 7sinh~!(2/3) +C or Tin(x#+ Va? +9) +C 
3. 5 sinh” *(2y/7)+C or In(2y+V/4y? + 49)+C. 


5. = sinh(3x) +C. 


33. aps oo (26°2” +a?) b2x? + a? 
—a‘ In(bx + \/b22? + a?)] + K 


35. v(t) = sinh! ((t—3)/V7) +8—sinh7!(3/V7). 
37. v(t) =sinh(¢t? + 2) + 4 — sinh(2). 

39. xcosh7! ax — Vx? —14+C. 

41. p(t) =tcosh7'(t/4) — Vt? —16 + Cot + Ch. 


3t 2 
V9t? —114+(8 
aa) 3 ( 


43. p(t) = 2tcosh! ( 
Inll)t+C. 
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ms Te Se a aa 37. (ti — B)(Y — 9) = Leary — FM = 


47. sinh~'a—cosh-!t=C. 


49. 5 (sin cosh +cosxsinhz)+C. 


Section 2.1 

85. 

In 120 & 4.787. 
30. 


(oN See 


11. 63. 

1s: 

15. Ak? =k? —(k 
17.8. 

19. 6k—7. 

a. A, 

2s. 


1)? = 2k -1. 


(Af)(x) = sina — sin(a — 1) 


sin x — (sinxcos1— sin 1cosz) 


= (sinx)(1 —cos1) +sin1 cosa. 


25. 
a. 1 
b. 0. 
27. 100/101. 
29. 1. 
31. 
Si(e@-2)=S>2-Soz 
=)oa-nz 
=} e- 5 2-0 
33. sx 4.444. 


35. Sita — #)? = Ya? — Wai t+ Vw’ = 
Ya? = (2/n) Nai Dai + (n/n?)(S.a)? = oa? — 
(1/n)(d> 2:)°. Divide by n — 1 to get the final re- 
sult. 


9 iu + MF = Yay — 2/n) Ord oy + 
(n/n) iyi = Ol aiyi) — A/n) i DD yi- 
39. f(m) = m4. 


“ i(i+1) 
41. F = —_———. 
(m) a 
43. The number of gifts received on the m-th day 
is the m-th triangular number. The cumulative 
number of gifts received by the end of the m-th 


day is the m-th tetrahedral number. 
Iwona 
45. 5 du +1). 


47. Prove by induction on n. 


49. Prove by induction on n. 


Section 2.2 
1. m(P) =3. 

3. m(P) =1/4. 
5. m(P) = 1/2”. 
7. False. 

9. ‘True. 

11. False. 

13. False. 

15. 


a. Yes. 
b. Yes. 


17. 3250 meters. 
19. 13/12 miles per hour. 


21. Outline: the Riemann sum is telescoping. The 
total displacement of a particle moving with con- 
stant velocity over an interval is equal to the ve- 
locity times the width of the interval (= the total 
time traveled). 


23. Follows from the fact that if s; is a sample 
point, then m < f(s:) < M. 


25. 3. 

27. 0.285. 

29. = 0.27683. 
31. —7. 
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33. 
35. 


& 3.92923. 


a. 1. 

b. & 1.366. 

& 1.459. 

d. w/2 = 1.571. 


S 


37. 15/32 = 0.46875. 
39. 15/8 = 1.875. 


41. Idea: Riemann sum linearity follows from the 
linearity of finite summations. 


43. 
100 + 3(2"** -1 
a. RB, (f) = ° ). 
‘ 100 
b. Rewrite the answer to (a) as Cty +6: 


2” — 0.5 . 
a The first term vanishes as n — oo. 


The second term approaches 6. 


45. Follows from the fact that f evaluated at the 
representative from U(P) is greater than f evalu- 
ated at the representative from L(P) on each in- 
terval of the partition. 


47. Let QO be a common refinement of P and 
P’. Then by parts (a) and (b), pe) 2 RE) e 
pclae < Ree, 

49. = 3.79. 

51. & 4.1365. 


Section 2.3 
1. 20. 

3. 64. 

5. 2880. 

7. —55/6. 
0. 

11. False. 


so 


Eg: fle) =, g(e) = 


17. 5.5. 


i 1 
19. ar + roti + ho. 


21. Answers may vary. For example, on the inter- 
val [0, 1], take f(z) = x ifaw £0.5 and f(1/2) = 10. 


23. 1/3 meters per second. 
25. 
27. 


1 meter per second. 


a. a(t) = 2t—5. 


b. 3 meters per second per second. 


29. 
-1 te€(2,4 
a. a(t) = ae, 
1 te (4,6) 
a(t) is undefined at t = 4. 


b. The average value of the acceleration func- 
tion is undefined; however, average acceler- 
ation, as the change in velocity divided by 
the change in time, is 0. 


31. 


a. The avg. velocity of the second particle 
equals the avg. velocity of the first particle 
plus C. 


b. The avg. accelerations are equal. 


33. Yes, since the function is continuous. 


35. = 0.4987. 
37. & 2.40. 
39. 40 + 237. 
41. 152. 

43. 1/6. 

45. 


a. Use the point-slope equation for a line. 


b. Integrate carefully. Looks complicated but 
many items are just constants. 


10 
47. / (12 — x)ra? de. 
(0) 


49. 5717. 
51. wR. 


Section 2.4 
1. 148. 
3. 0. 
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5. 24— 253 ed! + 50sin~! 3 Section 2.5 
| ; ‘ 2 1. 1/10 
7. (2+)/108. : 
9. In72 3. Diverges. 
. M72. 
5. Diverges. 
=7 = _ +4,/11 
11. cosh“? (#8) ~ cosh“? (3) = In (285542). | 
7. Diverges. 
97,7 
ne se 9. 1/2. 
15. 9In (3) —2. ll. —4. 
17. 2a f(a?) 13. Diverges. 
1 1 15. Diverges. 
19. 5 (v2+sinh* 1). 
2 aaa 17. Converges to 1/(1— 7). 
21. h(t) : 19. Converges. 
23. 18+2ln7 21. Converges. 
25. 0.4 23. Converges. 
27 27. Diverges for all p. 
29. Both are equal to 1. 
a. c= 1/21. 
: 31. 1/A. 
b. 19/63. 
33. 1/d?. 
a 35. wr +o. 
31. 329/6. a 
33. 0.2. — 


i 2 7 
S8: MaMp iy outs (oa) tector and emp iy: 41. Idea: take the derivative of @ given in the 


37 (b- ay previous problem. 
i 43. 

39. Follow the hint. 

41. 272. a. & 63.2%. 

43. 22-1 b. = 24.6%. 

45. 5e® — e?. ate aes 

47. 45. % 12.8 atoms. 


47. See solution manual for detailed answer. 
a. Make the substitution y = (7/2) — x. 


b. Use the fact that cos and sin are co- au: Ne: 
functions. 
c. The integrand of A+ B is 1 so the integral F 
of A+ B is 7/2. Section 2.6 
1. 
49. 
a. 20 + 35In a or 20 — 70 ere x —9.65543. 
a. 7. 7 5 
T. b. & —9.52603. 
c. & —9.91667. 


d. © —9.66667. 
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ST 2 


o 


11. 


13. 


a oO 


15. 


a. 4.5 (2/5 + In(2 + V5)) = 26.62097. 
b. 


253125 
16 
~ 9.49344. 


= 10.04141. 
= 9.73300. 


In ~ 9.66904. 


16+ 1n — & 24.13235. 


= 23.98561. 
= 24.44286. 
~ 24.18413. 


In 2/2 © 0.34657. 
= 0.34248. 
~ 0.35491. 
= 0.34726. 


-= + In 25 & —7.11446. 


—7.23043. 
—6.87083. 
—7.07685. 


2 


2 


2 


241m ($=) = 7.07470. 


124 3V15 
~ 7.07118. 


= 7.08175. 
= 7.07474. 


In ee = 1.65202. 


~ 1.64581. 
~ 1.66453. 
~ 1.65243. 


& 26.53705. 


c. & 26.78876. 

d. & 26.62160. 
17. 18 intervals. 
19. 2 intervals. 
21. 

a. © 1.80972. 


b. & 1.74036. 
c. & 1.78867. 


23. 


a. & 0.92733. 

b. & 0.92645. 

c. = 0.92704. 
25. & 0.69377. 
27. 7 & 3.139926. 


29. 
128 
3n? 
256 
3n? 
31. 
72e!? 
a 35 
144e? 
b. m2 
5913e° 
33. 
5n4 
35. Outline: Ax = (b — a)/4, let 
f(z) = «*. The Simpson’s approximation is 
Aa 


(f(a) +4f(a+ Aa) + ... + f(b)). Write every- 


thing out in terms of a and b. 


37. 


a. & 0.34198. 
b. & 0.68397. 


39. The question is equivalent to the problem: 
What proportion of the data is between —1 and 1 
standard deviations from the mean? We approxi- 
mated this in problem 37b): ~% 0.68397. 


41. 51 Joules. 


43. Since t is given in seconds and the velocity in 
miles per hour, one of the two must be converted. 
Answer: * 0.25611 miles. 
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45. 1.591517. Turns out the limiting integral is 
equal to the interesting number 7/e. 


47. = 4.20156 seconds. 
49. If A is small, then sin?(A/2) © 0. The integral 


/ 1 
i tly 274 / —. 
is exactly 27 4/ o9 


Section 3.1 
1. 1510 m. 
Ef aim ~ 1,089, 672 m. 


5. a Ge + v3) ~ 0.3119 m. 


tes “ ~ 0.2146 m. 


2 
9. Tc tan + : ~ 0.1168 m. 


11. 9.5 m. 
13. 1.0 m. 


2 


15. cos4sin4 + : (82 + 3sinh4 + sinh 12) 


54, 361 m. Note that there are equivalent algebraic 
expressions for this integral. 


17. 8V3(-1+ V2) + 5.74 m. 
19. —1+In8 = 1.08 m. 
21. 


a. 7/6 m/sec. 


b. 2/2 m/sec. 


a. (—4/3) In2 m/sec. 
b. 4In5 — 4In3 — (4/3) In2 m/sec. 


25. 
dhe ts . 

a. {p (sinh 16 — sinh4) m/sec. Answers may 
also be written in terms of the exponential 
function. 

b. 35 (sinh 16 + sinh 4) m/sec. 

1 2 2 


1 


29. p(t) = —tan~ t+ 5 n(1 +42) +547 — 5 m2. 


5 


= ty t i t i 1 j 
31. d(t) l+e 4 stat q sin 2t. 
Te 
. a(t) = 18t —- — —- —. 
33 (t) 8 5 3 
35. The distance function is continuous and dif- 
ferentiable at t = 3. 
37. 
—9.8t t€ [0 
20S ele] 
29.44 2(t—3) te [3,8]. 
b. -166.1 m. 


c. -20.7625 m/sec. 


39. Statement is false. Answers may vary, but 
one easy counterexample is v(t) = ¢ on the interval 
[—1, 1]. 


43. 4+ 2 —In5 m/sec. 


45. 1 meter. 


47. 


a. |v(t)| = le~*|| sint| < Je~*| + 0. 


b. Outline: first, note that v(t) doesn’t change 
sign on such an interval. Use repeated inte- 
gration by parts to calculate the integral. 


c. This integral is the sum of finitely many 
terms of the form given in part (b). Thus, 


[ole = 


d. Since e~” < 1, the series above is geometric. 
l+e” 
2(1—e-7) 


n 


il ve Se 


k=1 


It converges to 


49. 


a. Only thing to check is that the helix rises by 
34 angstroms per revolution. Note that an 
increase in t by 27 leads to an increase in the 
z coordinate of 34 angstroms. 


b. a’(t) = (—10sint, 10 cost, 17/7). 


D / 289 
d. 27,/10+ = 39.38 angstroms. 
T 


e. & 1.122 meters. 
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1. 
3. 


33 


4 
110/3 
e-1l 
4 

133/12 

36 

125/6 

(e—2)/2 

2(m — 2) 

34tan | 4— 8 = 37. 

500. 

5 

783/2. 

51n5. 

636/5. 

2 (cosh 10 + sinh7? 1 — V2). 

10 (sinh~* 10 — cosh~* 10) — sinh~* 1+ V2 + 


3vV11— Vv 101. 


35. 
37. 


1/2. 
Follows from the fact that |(f+h)-—(g+h)| = 


lf —gl- 


41. 


Area = = (-(U—1)+ UU). Area — 00 as 


Uo. 


43. 
45. 
A7. 
49. 
dl. 
53. 


A/2 


1 
Z (a a e(1/8)" the tg ual 


& 0.39133. 
c=1/2. 
c=6. 


a. f continuous implies | f| is continues implies 
|f| is integrable. 


b. If the integral is positive, take k = 1. If the 
integral is not positive, take k = —1. 


c. 


[i@el = kf toa = 


b b 
/ kf(a)dxz < / |f(a)|dz. The last in- 
equality follows from the fact that k f(a) < 


|f(2)|. 


b 


d. By the argument in (c), fe@ - 


a 


[ terae— foo) ae 


g(a)| da > 


55. Follows from the inequality ||x| — |y|| < |a—y]. 
Specifically, |j — h| < ||f| — Igll < lf — gl 


57. 7/12. 
59. arg. This is a circle. 
61. = (3v3-7) 
gg, 22-0 
2 
65. 1(R3 — R?) 
67. Analyst B 
69. 
B 
A 
C 
a. 
In the picture, the horizontal line is y = b, 
the vertical line is x = a. So ab is the area 
A+C whereas the right-hand side of the 
equation has area A+ B+ C. This proves 
the inequality. 
b. The inequality is an equality if f(a) = 6. 
71. 
Pp q 
a. f(x)g(x) < a + _ holds point-wise. 
Integrate point-wise and use the ” additional 
properties” to arrive at the result. 
b 1/p 
b. Set h(x) = f(x)/ (/ f(x)? az) . Define 


j(x) similarly using “g(x) and q. Then the 
” additional properties” hold for 7 and h. By 
b 


part (a), / j(x)h(x) dx < 1. Multiply both 
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sides of the equation by the denominators of 35. 8r. 
h and j to arrive at the inequality. 37. p'() = ra(ritra) 
(cos rab _ ogg (ritr2)y sin —— + sin Citra) ) 
TL Td: ? 
Section 3.3 39. Idea: cusps occur whenever w = 2n7, n an in- 
(0, 0,8) teger. The cusps therefore occur, in terms of 0, at 


1 
3. (110, —90, 220). 

5. (9,9,9). 

7. No, the derivative is the zero vector when t = 0. 
9. Yes. The map avoids the cusp at t = —2. 

1 

t 


1. No, the derivative is the zero vector when 


6s 5s s 
13. w(s 34 P sf 
ns V62 V62 Je 
dg 

15. Follows from the fact that ae | 1 for all 

[a, bj. 

r/2 
17. ——, 
i 4 


19. 2(-1+ (1+ 7)*”?). 
21. In(1+ 2). 


93. 5V2— 96+ 5in VY 8 


1+ 2 
ee a 
6a 4 
2: p(s) = 
,0cos ae? bsin 


; (acos 2s452,48in oot pa, 
wap) 
29. It is not regular as it fails to be differentiable 


at t = 0. It is not rectifiable since it is not one-to- 
one on any open interval containing zero. 


31. 
a. Let p(t) = R(sint,0,cost) for t € [0,7/2], 


R(cos(t — $),sin(t — $,0) for t € [x/2,7] 
and R(0,—cost,—sint) for t € [m, 2 


b. 37R/2. 
33. 


a. Apply the chain rule to ¢(t) = w(f(t)). 


b. Use part (a) and the Substitution Rule. 
Also, since (a) = 4(c), and all the functions 


U 
are one-to-one, it must be that a! > 0. 


6 = (r2/r1)(2n7). When r2/r; is irrational, cusps 
will start repeating themselves at a frequency con- 
trolled by 71. However, if r2/r1 is irrational, then 
for an arbitrary 0 we can find an n such (r2/r1)2n0 
is arbitrarily close to 6. 


Al. 2’ (0) = (ra—11) sin6+(r2—11) sin (ms ae 
y (0) = (r1 — r2) cos 6 + (r2 — 71) cos (ma ae 
43. 

a. Let f(t) = e(t)-v. Then f’(t) =é’-v. Use 


this fact and Fundamental Theorem of Cal- 
culus to obtain the result. 


b. Use the hint and the fact that @ is a unit 
vector. Le., |o] = 1. 

b 
zh t)| dt > pe: ae = 

~~ q): (p —4q) — |p q\. 

Ip — | 
45. 

a. CE gs r'(0)cos6 — r(0) sind, dy = 


do 
r’(0) sin 8 + r(8) cos 0. 


b. Follows from the fact that 2'(0)? + y'(@)? = 
r(0)? + 17'(6)?. 


47. 2nC. This is a circle. 


49. m1 +4n2 4 5 sinh! Qn. 


Section 3.4 
1. 


637 


a. (1/2)(—8 + 23 + In 27). 
b. a =Iny. 


7. 37/2. 
9. 337. 


11. 


13. 


15. 


Section 3.5 


1. 
3. 
5. 


31. 
33. 


58. 


1/8. 


15741 /15. 
w/5. 

ug 
2n+1° 
m(16 — 77) 
ee 
69297/16. 
3V30 

16 
~ 6.45. 


~ 1.1063. 
~ 22.4981. 
1 1 
2a | —~ — ——~ ]. 
’ (S 5) 


B/3. 


The integral is finite by comparison with 


Gabriel’s Horn. 


35. 
37. 
39. 
Al. 
43. 
45. 


~ 5.23016. 
Qn? Rr’. 

Qn. 

47507 /3 cm?. 
= 27.672 cm. 


27(1—a). The integral converges to 27. 


3¢(16/3) 
10 
49. 1°(R? — r)/3. 
51. a. 
53. 


47. Vo 


a. 7. 
b. The integral converges to 27. 
c. The integral converges to 27. 
55. The volumes of a right circular cone, hemi- 


sphere, cylinder and sphere are mr°h/3, 2ar7h/3, 
ar’h, and 4rr?h/3 respectively. 


Section 3.6 

1. 1527/10. 

3. me" V/1 + 2" — V2 + sinh‘ e” — sinh! 1). 
5. i" (s7v37 - i). 

7. 3305. 

9. a (29V145 - 2V10). 

11. 27 (4sinh4 — cosh4 + 1). 

13. 167/10. 

: (1 +75 +3sinh~* 2). 
17. t|m|\/14+ m?. 

19. ~ 8496. 

21. ~ 150.8. 

23. & 3.395. 


25. A= f 2m (f(e)—w) V1 + Lr @l ae, 


7. A= | a (hw 4) 14 (Pay ae. 


29. = l(a + 4b + 4)9/? — (a? 4 4py°"?]. 


0 m=0 
b. S is neither continuous nor differentiable at 
m= 0. 


S(m) = ae m #0 


33. 
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b. Annulus. a. 1. 
bed . . b. c= 7/3. 
35. One way to do this is to write the equation 
perpendicular to y = ma passing through (x1, y1). 19. 
Find where the two lines intersect and use the dis- 
tance formula. a. 2sinh1. 
37. A= fatto = 1+ f(a) de. 
e /1 + m? VA f'(x) | 5 L a nee 
39. _ 2 n 5 e } . 
a. A= f° Inf(x)/1+ f(a)? dx. Here we as- 23. ae 
sume f(x) > 0. 
ae 25. 45/2. 
b. A = =f 2n(f(z) — 2) V1 + f(a) de. 97, = 14.689 ke. 
We assume f(x) > 2, or that f lies about 99 ¢ 6/5. 


the line y = a. 


c A= / Qrarv/1-+ f'(a)2 dx. We assume 
z>0.- 
41. 167. 
43. 167. 


5 e® — 34 
45. / Qn »-V14+ e2" dz. 
2 Vv 10 


Qn ft ( 
47. — 2 
V5 Je 


8 
49. an 


Section 3.7 
1. 18sinh5. 

3. 7207. 

5. 83, 5927/5. 
7. 34567/35. 


sO 


yo/m 
| 6(x)272(yo — max) dx. 
0 


b 
1d, ach f x6(x) dx. 
0 


13. 27(491n 49 — 25 1n 25 — 24). 
15. 


a. 36. 
b. © =—-—7+ V85. 


17. 


31. 22.5 kg/m?. 


33. Numerator follows from comments in this 
chapter. Denominator follows from the shell vol- 
ume formula. 


35. 103.4 kg/m’. 
i. 
a. 1177/t. 
b. —1177/t?. 
39. m(t — 1)’?(16+9In9(In9—2)) + 
3nt(In 9)? In 81. 
41. 2n(5+1n6). 
43. rme*(e?? — 1). 


45. 783/2. 


1 
47. ae =e = 41), 


49. 844/3. 


Section 3.8 

1. (4/19, 10/19). 

3. (3,3). 

5. (2/27, -1/9, —2/9). 
7 


a. 25225/12. 
b. 1009/192. 


2 
a. 57 (660 + 7In(17/5)). 
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2(660 + 7 In(17/5)) 


3(48 + 7In(17/5)) | 


11. 


a. 


(In 5)? 


b. 
Aln 


13. Density functions should not take on negative 


values. 


15. 


1/3. 
11/60. 
c. 3/20. 
11/20. 
e. 9/20. 


T 2 


oO 


17. 


6 


c. 13/8. 
0. 


o 


—4+131n5 


5 


25(—4 + 13In5) 


(-21v3 + 3V15 — 20m + 768 sin” *(1/4)) : 


156 


19. 


2401/6. 


a 2 


4802/15. 


c. 31213/30. 


oO 


4/5. 
e. 13/5. 


—20n — 21/3 + 3V15 + 768° 


21. (27/16, 39/20). 
23. (0,0,3/4). 


25. Substitute g(a) = 0 and cancel out 6(x) in the 


formulas for % and y. 


33. (6,19). 


35. 


(0, 


37. 


39. 
Al. 


43. 


45. 


47. 


49. 
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0, 
( 6/55 — 97 + 128sin—!(3/8) 


5.044). 


7 —4e3 _ 
8 — 2e3’ 2e3 — 8 


(t/3,t/3). 
(0, 5t/17). 


e® 


7 


) 


~ 


) ~ (2.27975, 12.3225). 


(24 6cost sin2t+ 6sint 


14 


a. pi (t) = 


(—2 — (3/4)t?,2), pa(t) 


? 


14 


). 


(7/4)t?, —3), a(t) = (4,14 #7). 


2 2 
b. (: +7 14-2¢ 


4 5) 


4 


c. — #1.1. 


2 
d. 8/5 N. 


e. Same as (c). 


3/2 
10 © 

v2 
10° 


), 


(1+ 


Section 3.9 


1. 
3. 


5. 


225 joules. 
4/15 joules. 


5(e® — 1) 
4el2 
Yes. 


0 joules. 


joules. 


—4.21875m joules. 


k = 224, 


k = —80/In2. 


k =9/61. 


0 joules. 


2592 joules. 
44, 100 joules. 


4575 ft-lb. 
27 
hPo — ne 


54 joules. 


ft-lb. 
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31. 


1 
W= 5 (mui + mug) + mg(hi — ho). 


This is the sum of the changes in the kinetic and 
potential energies. 


33. 
35. 
37. 
39. 


41. 


63/4 joules. 
—12 joules. 
0 joules. 


3 (30 — z)(62.4)(160) dz ft-lb. 


39(30 — z)(62.4) (152°) dz ft-lb. 


225 


Section 3.10 


1. 
3. 
5. 
a 
9. 


37440 lb. 
13, 720, 000/3 N. 
7530.4 lb. 
Sw H? 

a 


You can either show that the force on one half 


of the wall is half the total force, or show directly 
that the forces on the two halves are equal. 


Section 4.1 


1. 


a: to=O: plrg)=4; p (20) =4; pea) = 4 


errors are all 0. 


b. to = 0.1:  p(ao) = 3.751; p'(x0) = 3.7; 


p’ (ao) = 3.75; errors: 0.051 and 0.001. 


c. 9 = 0.01: pao) = 3.970492; p'(ao) = 


3.97; p? (xo) = 3.9705; errors: 0.000492 and 
0.000008. 


d. z = —0.001:  p(ao) = 4.0030050091; 


p' (to) = 4.003; p?(ao) = 4.003005; errors: 
0.0000050091 and 0.0000000091. 


a. to =1: p(so)=7; p' (to) =7; p’ (xo) = 7; 


errors are all 0. 


b. ao = 1.1: p(ao) = 7.219985; p' (ao) = 


7.1; p’ (xo) = 7.22; errors: 0.119985 and 
0.000015. 


c. 2 = 1.01: p(x) = 7.0112; p'(xo) = 


7.01; p?(ao) = 7.0112; errors: 0.0012 and 
1.5(1071?). 


ie 
9. 
11. 


ro = 0.999: p(ao) = 6.999012; p' (ao) = 


6.999; p*(ao) = 6.999012; errors: 0.000012 


and 1.5(107"). 


~to= 0: peo) — 1; po) = 1) Po) = 1 


errors are all 0. 


a = —0.01: — p(xo) = 0.9900990099; 


p'(2o) = 0.99; p?(ao) = 0.9901; errors: 
0.0000990099 and 0.0000009901. 


. 9 = 0.001:  p(ao) = 1.001001001001001; 


p' (ao) = 1.001; p?(xo) = 1.001001; errors: 
0.000001001001001 and 0.000000001001001. 


8 


12(x — 1)’. 
(x) = (© +2) + (w+ 2)”. 


qd 


13. q(x) = 1000+300(2—5)+30(a@—5)?+(a—5)%. 


15. q(x) = 3+ 10(@ — 1) +.17(# — 1)? + 13(a@ 
1)? + 4(@ — 1)4. 

17. 63.35. 

19. 14. 

21. E*(x) = 0.24 

23. E*(x) = 0.07 

25. E3(x) = 0.0972 

Section 4.2 

i 


. f(0.001) = 1.0010005; 7}(0.001) = 1.001; 


T; (0.001) 
1.0010005. 


1.0010005; 77(0.001) = 


ge x2 at 
_Te(z) = 1-4; THe) = 1-544; 
a? at 2 
T} (x z)=1 2a 6° 


. f(2) = —0.4161468; T}(2) = —-1; T?(2) = 


—0.333; T?(2) = —.4222. 


. f(0.001) = 0.9999995; T; (0.001) = 


0.9999995; T; (0.001) = 0.9999995; 
T; (0.001) = 0.9999995. 
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5, 
a. Ty(z) = l+a; T(t) = lt+2e4 2’; 
T?(x) =1l+2+a7? 42°. 
b. f(2) = -1; Ty(2) = 3; TF(2) = 7; 
re (2) = 18: 
c. f(0.001) = 1.001001001; T;(0.001) = 


1.001; 7?(0.001) = 1.001001; 13(0.001) = 
1.001001001. 


7. Ti (0) =z. 


9. T?(2;1)= 2-214 8H 

11. T?(a;1) = 14+4(e—-1)—4(@-1)? + 3 (2-1)°. 
13. T?(x;1) = 2(@ — 1) — (ew — 1)? + 2(2— 1)*. 
15. 


a. T}(0.1;0) = 0.1001666. 
b. | sinh(0.1) — T(0.1;0)| = 0.00000008. 


Ite 


a. T?(—0.1;0) = 1.005. 


b. | sec(—0.1) — T?(—0.1;0)| = 0.0000209. 
19. n=1. 
21. c= sin=2 

2 3 

23. TS (a; 1) = 4 a i sory \ arte) 
81(x+1)4 

Ales 
25. TE (t F) = —2@- 3) + me 3) — He 
5) 4+.. 


4d pi 


co §=6(—3)F (a@— 3)* 
am 10kk 


29. T?(t;3) = In(10) 

31. TS°(#;—-1) = 1— 2(@ +1) 4+ (2 +1)’. 

33. T$°(#;1) = oe, Hew 

35, LO (@— a)” 

Section 4.3 

1. a. E¥(x;0) < (0.1)*/(244!)); b. n= 2 (ac 


( 
tually, since a 50) = T7(#;0), n = 1 is a better 
answer); |a| < (2*4!(0. 0001))!/4, 


3. a. oe 1) < (5*(0.1)*)/(4!); bz 
|a — 1| < (4!(0.0001)/54)1/4. 


n=4; ¢. 


5. a. E?(x;1) < (0.1)4/((0.9)*4); b. n=3; . 
|x —1| < 0.1(4"/*)/[1 + 0.1(4"/4)]. 
7. a. E}(e;-1) < 11(0.1)*/2;  b. 


= 4; 
c. need 5(1 + 6)6* < 0.0001; mone 6 = 
(0.00001)*/* works. 


9. a. T?(2;0) = 2-2; b. E}(2;0) < 1/6! 
ll. EF(2;0) <e/(n+1)!. 
13. E?(x;0) < (0.3)"t"/(n + 1)!. 


15. ER(x;0) < (e+e ™*)/(n+1)I. 
17. 0 


25. b. Usek=6: e% 14+141/2!4+1/3!4+1/4!+ 
1/5! + 1/6! = 2.7180555. 


Section 4.4 


1. 4th partial sum = 0.58333. 10th partial sum 
“=” 0.6456349206. Yes - the 10th partial sum is a 
better estimate of In 2. 

3. a. (1/6)a?; ib. 
sin(0.1) = 0.00016658335317; 
(1/6)(0.1)? = 0.0001666. 

5. a. —1/2; b. —1/2. 


7. Calculator: (1.09)+/8 = 
proximation by 1 + (1/3)(0.09) — 


Calculator: 0.1 — 
Approximation by 


1.0291424665715; Ap- 
(1/9)(0.09)? = 


1.0291. 
9. In the Binomial Theorem, replace x by —x and 
let p = —1 to obtain the Geometric Series Theorem 
with a= 1. 
11. 
a. T7°(a;0/4) = (1/V2) — (1/v2)(a — 1/4) — 
(1/V2)(@ — 1/4)?/2) + a ae — a4) 8) + 
(1/V2)(x — 1 /4)*/4! — 
b. Alla 
13. 

yk ‘ 
a. T?°(x;2)=n3 4 ae 2 (o— 2)*. 


b. In(1+ 2) equals its Taylor series, centered at 
2, for —1 < x < 5, but a smaller interval, such as 
1 <2 <3, is what you’re expected to find. 


15. b. andec. T?°(x;0) equals the power series ob- 
tained by multiplying each term of the Maclaurin 
Series of e® by a. 
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Section 4.5 
Converges. 
Diverges. 
Diverges. 


Converges. 


SO a. SOP AE 


Converges. 


11. Converges. 


13. 
a. 1/5. 
b 4 6 4 6 4 6 4 6 
“15° 5]? (55)? (57°57? (55 
c. 


p(1.1) © 83/144 
p(1.01) © 0.050614 
p(2) ¥ 224/9 


d. We do not have an explicit algebraic function 
that we can type in on the calculator. 

e. We expect good estimates for x € (3, $) 
since the remainder term converges to zero 


within the radius of convergence. 
15. 


a. 1. 
b. (—1,1), [-1, 1), (-1, 1], [-1,-1]. 


p(0.1) © 1403/3000 
p(0.01) ~ 0.49667 
p(1) © 4.15 


d. We do not have an explicit algebraic function 
that we can type in on the calculator. 


e. We expect good estimates for x € (—1,1) 
since the remainder term converges to zero 
within the radius of convergence. 


17. 


a. 7. 
b. (—3, 11), [—3, 11), (—3, 11], [—3, 11]. 


19. 


21. 


p(4.1) © 0.144927 
p(4.01) © 0.1430615 
p(5) © 0.16618 


. We do not have an explicit algebraic function 


that we can type in on the calculator. 


. We expect good estimates for x € (—3,11) 


since the remainder term converges to zero 
within the radius of convergence. 


1/5. 


b. (—o0, 00). 


p(—6.9) © 0.097537 
p(—6.99) ~ 0.009975 
p(—6) © 85/108 


. We do not have an explicit algebraic function 


that we can type in on the calculator. 


. We expect good estimates for all real x. 


». de 
b. (—1,1), [-1, 1], [-1,1), (-1, 1]. 


p(0.1) & 0.092756 
p(0.01)  0.0099213 
p(1) ~ 0.899661. 


. We do not have an explicit algebraic function 


that we can type in on the calculator. 


. We expect good estimates for x € (—1,1) 


since the remainder term converges to zero 
within the radius of convergence. 


23. That they are mistaken. 


Section 4.6 


1, 
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ol 


11. 


aa mem 


. Taylor approx: 


oo 
(-3 


. Taylor approx: 


7 +42) 


2 3 4 
. 3-52 ss “s +5 +.... Converges on 
(—co, 00). 
. Taylor approx: 2.485, Calculator: 
2.485845413. 
ee (2 — 2)°* Converges on 
k=0 
1 1 
2— —,24+ — 
0- 3n?* wa) 
. Taylor approx: 0.996016, Calculator: 
0.9960159. 


oo 2k+1 

. Converges on (—1, 1). 
k=1 

0.00784427, Calculator: 
0.0078441426. 


(x ob 2) eee 


hal Converges 


on 


+i. 1). 


—0.0996687, Calculator: 
—0.0996687. 


co 


9 x”. Converges on (—1, 1). 
k=0 


. Taylor approx: 0.76, Calculator: 0.7513148. 


x—4 (x —4)? (e-4)?  («#-4)* , 


eae kt a 720 © 40320 ' 
.. Converges on [4, 00). 
. Taylor approx: 0.995004 , Calculator: 


0.995004. 


The radius of convergence and center of the 
series in Exercises 13-21 of this chapter can 
be found in the solutions to Exercise 13-21 of 
Chapter 4.5. By the theorems in this chap- 
ter, the radius of convergence is the same 
for the differentiated and integrated series. 


13; 
ised 5*( ert 
[oe eG aie 
soul 5*( a4 
k=1 
15. 
ak 
)d a 
= C= o+C1 i(k +2) 
k=0 
eo gk-t 
= ead) “aD 
17. 
_ < (g = 4)**4 
[ Peyan= 0+ >) ye 
i — k(a — 4)*-1 
Pp («) = ye 7k+1 
k=1 
19. 
_ = kei (a + 7)*** 
[ loyan= 0+ 3 1) (k-+ 1) 
oy rei (e+ 7)*7* 
p (x) df 1) yRaI 
21. 
i ght 
p(x) dx =C +5 (-1)*** 
/ D. (k+1)Wk 
Coa yh eels k-1 
k=2 
x 5a? Tx? 474 
286 Ada oh cag agg 
x x 1324 
D5 ewes 
2 Ra hen ogee 
27 (c — 3) (=+) Gras BY sk 
da 1 Par ga 1 1 
3)°4 x 
“v3 a) °-9 Ws =~ 3) 
A Ts a 1 : 3 
29. 1+ Meee en 
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3 4 
ux 6 + 6 
33. 
2 
Go £ 
35. l-=+—. 1 
2 2 a. bra = i k>1, 
5 
a eS. 
2 b. 
39. 
bo = 1 
a. Idea: an even series converges at a iff the —_ 
series converges at —a. i 
b. Straightforward, but keep in mind there are ba = 31/2 
four things to prove. b3 = 26 
ba = 781/24 
Al. : / 
bs = 651/20 
e504 git be = 19531/720 
: | 3 
tay (6 + 12k!) by = 4069/210 
b. 1/2. bg = 488281 /40320. 
43. 
a. Follow the hint. 
2° ol ji c. © 1.7576. 
b. a 
» (kK—Dlp— By!” 
c. Multiply the result of (b) by 14+ a. 
d. Separate the variables and integrate. 5be + =i 
a. br+i k 1 ry k = 3, 
45.  cos(30) = cos®@ — 3cos@sin?@ sin(30) = om 
3.cos” @sin@ — sin® 6. b. 
bo = 2 
Section 4.7 boa 
1. by = 55/2 
ie 5bk eee. b3 = 277/6 
k+1 by = 1391/24 
be bs = 6967/120 
bo = 2 bs = 6971/144 
bi = 10 by = 34861/1008 
bz = 51/2 bs = 871567/40320. 
b3 = 85/2 
ba = 425/8 
Oy = 25/8 c. 17227/6000. 
bg = 2125/48 


b7 = 10625/336 
bg = 53125 /2688 
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c. & —0.900308. 


4b, + 2 
. bes = kl > 4. 
Th (ea eee 


b. 


bo =1 

by =0 

bo =3 

bs = 1/3 

by = 13/12 
bs = 1/12 

be = 53/360 
by = 1/120 
bs = 71/6720. 


c. & 1.030442. 


10x + 32 (e°* —1 


25 


5a). 


13. y(x) = Te?) where p(x) = (x? /3) + 2x3 + 
4x + 8/3. 


b(1 — e** + ax 
15. y(x) = ( 2 ) 
_ 1 2a —22 
19. p(x) =1+24 So 50° 
Section 5.1 


1. Converges to 1. 


Converges to 1. 
Converges to 0. 
Converges to In 2. 
Converges to 1. 
11. Converges to e°. 
13. Converges to 0. 
15. Converges to 0. 


17. Use 2/n? and 12/n? as pinching sequences. 
4n—3 4n+3 
n 


19. Use and 
n 


21. Let an = sin (492) and ba = sin (S252), 


as pinching sequences. 


23. Use the subsequences 1/2,2/3,3/4,... and 
ey ey 


25. 
a. No, the sequence need not converge. Let 
Gn =5+(-1)”. 
b. Yes. 


c. Yes, since the sequence is monotonic and 
bounded. 


27. Idea: If an + bn converges and an converges, 
then bn = (adn + bn) — an converges. 
29. Idea: If b,/a, converges and a, converges, 


b 
then b, = — - dn converges. 
an 


Section 5.2 

1. 

s,=1/1l=1 

sg =141/4=5/4 

83 = 141/44 1/7 = 39/28 


s4=141/441/7+1/10 = 209/140 
1/4+1/7 +1/10 + 1/13 = 2857/1820. 


so=l 
gs) =1-2=-1 


sg =1—-—24+4=3 
83 =1-—2+4-8=-5 
Ss4=1—2+4-8+416=11 
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5. c. Converges to 2. 
iy ok, 3 
gers a 17. 
eS ta A 
so = (1-1/4) + (1/4—1/9) = 8/9 a. bg = 1/2, b4 = —1/3, bs = —1/12, be = 
s3 = (1-1/4) + (1/4 — 1/9) + (1/9 — 1/16) = 15/16 —1/30, br = —1/60 
—2 ee 
b. bj G-Da- GLa if i > 3. bs = 1/2. 
Similarly, s4 = 24/25 and s5 = 35/36. c. Converges to 0. 
re 19. 
s-3 = (-8)? =9 ich = Spe SS Say S SS 
$45 (23) 4 (2) S418 —1/4, bs = —3/20 
s_1 = (—3)? + (-2)? + (-1)? = 14 b. b= ay HED Ly =5. 
so =s1+0°=14 


c. Converges to 2. 


21. Diverges. 


9. 23. Diverges. 
Cee eee 25. Diverges. 
2 2 3 27. Converges to 7/12. 
$= -+ == 
1 2 29. Converges to —7/4. 
Qi 2h 2 
s6=7TT5 + 3° 11/3 31. Converges to —1/4. 
De De 2D) In2 
= +-~=2 : -—. 
aa yt ag 5/6 33. Converges to 10 
a a a? 
= jaf 35. Converges to 73/12. 
88 1 5 T 3 T 4 T 5 137/30. 
37. Diverges. 
1. 39. Diverges. 
a. by ljbo = —1/2,b3 = -1/6,b4 = 41. 767/3333. 
—1/12, bs = —1/20. 43. 271801/99990. 
My ae. 
b. 6: = Gai) ifi>1.b =1. 45. Converges to 20. 
c. Converges to 0. 47. Diverges. 
49. 
13. 2 
a. The subsequences don’t converge to the 
a. bo = 1, by = 2, be = 2, b3 = 2, b4 = same value. 
ps b. Aft ing pai h 
. : er grouping pairs, we have: 
b. bo = 1, b; = (—1)°- 2 fori >0. - . 4 
c. Diverges. a 4 8 aa 
15. which converges. 
ae Obj. = “es bs = 2- S — 51. Answers may vary. It shows the terms of a se- 
oa — e? ries may converge to zero while the series diverges. 
— 2e — 3e 
ae? ba = ee 53. Compare with 1/(3k), for example. 


b. b= SEY itis 0. bo = 2. 55. 
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b. 2. 


c. The dog runs twice as fast as the man, and 
so covers twice the distance. 


Section 5.3 


1. Diverges by the p-Series Test. 


31. Converges. 
33. Diverges. 
35. Diverges. 
37. Converges. 
39. Converges. 
41. Converges. 
43. Diverges. 


45. Hint: one can use an argument similar to the 
proof of the ratio test. 


47. Apply the natural logarithm to the second 


formula in Corollary |5.3.30 


3. 
a. Converges by the p-Series Test. 
b. Using inequalities from p-Series Test, be- 
tween 49 — 7 (1+2-*/7 +3-*/") and 56 
7 (142-87 4374/7), 
5. Diverges. 
he 
a. Converges by the p-Series Test. 
b. Between 5.5 and 13.5. 
9. Diverges. 


2 
11. Compare with the convergent series > Re 


pt+l1l 


13. Compare with the divergent series > 7 


4 
15. Compare with the convergent series y = 

m 

2 
17. Compare with the divergent series y pr: 
19. Compare with the convergent series y — 


1 
21. Diverges. Limit compare with y rs 
23. Diverges. Limit compare with y (p+ 1). 
1 
25. Converges. Limit compare with y — 
m 


27.  Diverges. Limit compare with :3 us a 
pi/3 
k1/2° 


1 
29. Converges. Limit compare with es 5K 


Jk 
oS 


Section 5.4 

1. Converges absolutely. 

3. Converges absolutely. 

5. Diverges. 

7. Diverges. 

9. Converges absolutely. 

11. Diverges. 

13. Converges conditionally. 


15. Converges absolutely. 


17. Converges by Theorem }5.4.11] co = 1, a1 = 
—1/4, co = 23/72, ca & 0.183287, cs = 139/2000. 


19. Converges by Theorem|5.4.11] co = 1, ci = 0, 


c2 = 0, c3 = 0, c4 = 0, cs = 0. 
9999 1 

21. a= 
a i 
99 1 

23. =) 
dA ) Vk24+1 


25. Converges conditionally. 
27. Diverges. 


29. The final step of both parts of the theorem 
follow by the Cauchy criterion and the fact that 
the series a, and b, converge absolutely. 

31. Let an =bn = c. 

33. Unlike finite sums, infinite sums are not, in 
general, commutative. The order in which terms 
are summed matters. 
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35. Converges on (0.8, 1.2] and absolutely at both 39. Converges on [—3,11). Converges condition- 
endpoints. ally at x = —3. 


37. Converges on [—1,1] and absolutely at both 41. Converges on (—7 — e,—7 +e). Diverges at 
endpoints. both endpoints by the Term Test for Divergence. 
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Index 


i-th subinterval of a partition, axis of revolution, 
p-Series Test, 
p-series, [580 base of a cone, [298] 
p-series, alternating, between, 

between, strictly, 
Abel, Niels Henrik, bijection, [575] 
absolute convergence, [598] binomial expansion, [456] 
absolute error, binomial series, [455] 
acceleration, Binomial Theorem, 
acceleration vector, bounded above, 
alternating p-series, bounded below, [537] 
alternating harmonic series, bounded function, [110] 
alternating series, bounded set, 
Alternating Series Test, buoyancy force, [385] 
analytic continuation, real, 
analytic function, real, [462] cardioid, 
angle addition formulas, Cauchy Criterion for Series Convergence, [596] 
annulus, [300] Cauchy product of series, [601] 
anti-derivative, Cauchy sequence, [537] 
anti-derivative on an interval, center of mass, 
anti-differentiation, centered at a, 
are length, centroid, 
arc length function, [266] choose, [456] 
Archimedes Principle, [385] circular trigonometric functions, 
area between graphs, collision, inelastic, [379] 
area function, cross-sectional, collision, totally inelastic, 
area in polar coordinates, Comparison Test, |582 
area inside a circle, completeness of the real numbers, [538] 
area swept out, completing the square, 
associativity for convergent series, [566] complex numbers, 
average density, [345 component functions, 
average value, [132] components of a vector, 
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INDEX 


conditional convergence, |455} 
cone, 
Conservation of Energy, |366} 


Conservation of Mass-Energy, |366 
continuous, |619 

continuous summation, [116] 
continuous variable, 
continuous, piecewise, |112 
continuous, uniformly, [203] 
convergence of a sequence, [463] 
convergence of a series, 
convergent integral, 
convergent series, 
converges, 
converges absolutely, 
converges conditionally, 
CORDIC, 


curve, parametric, |276 

curve, rectifiable, 

curve, rectifiable, piecewise-regular, [268] 
curve, simple regular, 


cylindrical shells, |307| 


decreasing sequence, [539] 
definite integral, 
density, }334 

density, area-, 
density, average, [345] 
density, length-, 
difference operator, 
differentiable, 
differentiable, infinitely, 
differentiating power series, 
Dirac delta function, [256] 
Dirac, Paul, 

direction of a vector, 
discrete variable, 

disk, 

displacement, 


distance traveled, 
distance traveled function, 
divergent integral, 

diverges, 
dot product, 

dummy variable, 


eccentricity, |292 
Einstein, Albert, 
element of arc length, 


elementary function, 
energy, kinetic, [368] 


energy, potential, 
equilibrium position, [374] 
error, [400] 

error in approximation, [181] 
error, absolute, [181] 

error, Taylor, [426] 

Euler’s Formula, 
Euler, Leonhard, [502] 
evaluation notation, [139] 
eventually, for a sequence, 
exceptional values method, 
extended real number, 
extension of a function, [159] 


factorial, 
Feynman, Richard, 


Fibonacci sequence, 

finite difference, [72| 

finite difference function, 

force field, conservative, [372 

Fourier series, [579] 

Fourier, Jean Baptiste Joseph, 
Fundamental Theorem of Calculus, 
Fundamental Theorem of Calculus, Part 1,[143] 


[208] 
Fundamental Theorem of Calculus, Part 2, 


Gabriel’s horn, 


INDEX 
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gamma distributed, 
gamma function, 
geometric series, 
geometric sum, 

germ of f, 

Golden ratio, 


greatest lower bound, 


harmonic sequence, 


harmonic series, 


harmonic series, alternating, |488 


height of a cone, [298] 
heliocentric coordinates, [292] 
Hooke’s Law, [374] 
hydrostatic pressure, 
hyperbolic cosine, 
hyperbolic sine, 


improper integral, 
increasing sequence, }539 
indefinite integral, 

index of summation, 
induction, mathematical, 
infimum, 

infinite series, [464] 

infinitely differentiable, 
infinitesimal contribution, [116] 


integer interval, 
integrable, 
integrable, Riemann, [108] 
integral function, [140] 
integral on F170 


Integral Test, 
integral, convergent, 
integral, definite, 
integral, divergent, 
integral, indefinite, 
integrand, 


integrating power series, |496 


integration by parts, 

integration by substitution, 

integration by trigonometric substitution, 
[32] 

interior of an interval, 

interval centered at a, [467] 

interval of convergence, 


inverse hyperbolic trigonometric functions, 
iteration equation, 


joule, 


Kepler’s 2nd Law of Planetary Motion, [292] 


Kepler, Johannes, 
Kirchoff’s Law, 


Lagrange, Joseph-Louis, [426] 
lamina, 

Laplace transform, [178] 
Laplace, Pierre-Simon, [173] 
least upper bound, 


least upper bound property, 
left Riemann sum, 


Leibniz, Gottfried Wilhelm, 
length of a curve, 
limagon, [290] 

Limit Comparison Test, 
limits of integration, 

linear combination, 

linearity of anti-differentiation, [6] 
linearity of differences, 
linearity of improper integrals, [171] 
linearity of integration, 
linearity of summation, 
linearization, [412] 

Liouville, Joseph, [21] 

long division of polynomials, 
lower bound, 


lower sum, |202 
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Maclaurin polynomial, position, 

Maclaurin series, [414] position function, [249] 

Maclaurin, Colin, position vector, 
magnitude, Power Rule for Integration, 

mass, [334] [339 [347] power series, 

mass midpoint, power series function, [465] 

matching coefficients, pressure vector, [385] 

mean value, [132] pressure, hydrostatic, [385] 

Mean Value Theorem for Integration, [132] product of series, 

measure zero, [112] Product Rule for the dot product, |620 
mesh, 

Midpoint Approximation, radial vector, 

midpoint Riemann sum, radius of convergence, 
moment, radius of interval, 

monotonic sequence, [539 range of the index of summation, 


Ratio Test, 
Newton’s 2nd Law of Motion, rational function, [39] 
[370] [372] [373] real analytic continuation, [481] 


Newton, Sir Isaac, real analytic function, 

non-negative series, [572 rearrangement, [600] [610] 
rearrangement of a series, [575] 

parameterization by arc length, rectifiable, [268] 

parameterization with unit speed, refinement of a partition, 

parameterization, rectifiable, refinement of a sampled partition, 

parameterization, simple regular, [260] remainder, 

parameterized curve, remainder, Taylor, 

parameterizes, |260 repeated linear factor, [47] 

partial fractions decomposition, repeating decimal, 

partial sum, represented by a power series, [444] 

partial sum of a series, Riemann sum, 

partition of an interval, Riemann sum, left, 

permutation of indices, [575] right Riemann sum, 

piecewise-continuous, Root Test, 

piecewise-simple regular curve, [265] rose, 3-leaved, 

Pinching Theorem for Sequences, [535] 

point-mass, [348 sample point, 

polar coordinates, [281] sampled partition, 

polynomial function, scalar, 


polynomial, centered, scalar multiplication, 


INDEX 
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Second Derivative Test, 
separation, totally inelastic, 
sequence, [92} [445] [463} [537] 
sequence, Cauchy, |537 
sequence, decreasing, |539 
sequence, harmonic, 
sequence, increasing, |539 
series of constants, 
series, infinite, [464] 

series, non-negative, [572] 
shells, cylindrical, 
shifting indices, 

sigma notation, 
Simpson’s Rule, 
slug, |336 

solid of revolution, [300] 
specific weight, 

speed, 
splitting summations, 
spring, |374 

spring constant, 


subsequence, 
subseries, [573] 


subset, 
substitution in definite integrals, 


sum of a series, 
summation, 


supremum, |202} 
surface of revolution, 


tautological initial data, [9] 
Taylor polynomial, 
Taylor series, 

Taylor series function, [445] 
Taylor’s Theorem, [426] 


Taylor’s Theorem with Integral Remainder, |442| 


443 


Taylor, Brook, 
Taylor-Lagrange Theorem, |211| 


telescoping series, 
telescoping sum, 
Term Test for Divergence, [559] 


terms of a series, |464| 
Third Derivative Test, 


Trapezoidal Rule, 
triangle inequality, |601 
trifolium, |288 


unbounded, 
uniformly continuous, 


unit vector, 

universal gravitational constant, 
upper bound, 

upper sum, [202] 


variance, [157] 


vector, [615] 

vector addition, 
vector-valued function, 
velocity, [9] 


velocity vector, 
vertex of a cone, [298] 


volume, 
volume inside a sphere, [299] 


washer, [300] 


work, 
work done against, 


zero vector, [615] [618] 


